That's cool! I did the same for the Quran to see how RAG works. I also indexed related works called "Hadith" and the names of Allah. It initially required indexing everything using OpenAI embeddings and then powered by it.

https://reminder.dev/search

It's also open source

https://github.com/asim/reminder

Yes! looks like yours has more features (I considered generating the audio) FWIW I observed that the embedding gives the most signal when I pass whole paragraphs (for all I know the Quran is in lyrical verse so maybe that doesn't work)

Ah interesting. So I'm using an English translation. Tbh I wish I could do more with the Arabic but the models are not perfect at that. But the idea of giving it whole paragraphs makes a ton of sense.

Tried to generate audio but wasn't satisfied with the interpretation of some words, saying that thou may try again.

[deleted]