Very interesting, I still need to test it more, but it seems like it parses only the arxiv PDF data. For getting more accurate equations it might be a good idea to download the original tex source and let it parse that (maybe even both).
EDIT: Another thought: maybe the output could also support markdown/latex like chatgpt.
Thank you, that's probably correct. I think the gemini api might turn the pages into images and use those. Sending the original tex source was something I thought of but not all papers have those submitted.
As for markdown / latex output that could be done, especially for equations! I'll have to look into the best way to render that.