Hacker News

wittjeff 3 days ago [ - ]

I'm always happy to see more innovation in this area. It'd be great if you could make your model, weights, and training corpus public (preferably under a permissive license) on GitHub. It'd also be great if you could run some benchmarks against the other similar tools in this area (I'm thinking particularly of Mathpix, Equatio, and Microsoft's math OCR in OneNote, Word, and Azure APIs. If you make your test corpus and code available I could set up the benchmarks for you.

MayeulC 3 days ago [ - ]

I agree that it would be nice if the model was open weights and could run locally.

I have digitized almost all of my college handwritten notes, I would love to transcribe them, check them for errors, and contribute that as training data, but only for open weights models.

kragen 3 days ago [ - ]

I feel like an open weights and even open training data model is just about inevitable here, because a lot of people will feel the same way you do.