I was just working on a RAG implementation for >500k news articles, completely local, using postgres as a vector database: https://github.com/r-follador/TeletextSignals
I'm positively surprised on how well it works, especially if you also connect it to an LLM.