Hacker News

concinds 3 days ago [ - ]

And it's a 4B model. I worry that nontechnical users will dramatically overestimate its accuracy and underestimate hallucinations, which makes me wonder how it could really be useful for academic research.

DGoettlich 3 days ago [ - ]

valid point. its more of a stepping stone towards larger models. we're figuring out what the best way to do this is before scaling up.

spicyusername 7 hours ago [ - ]

If there's very little text before the internet, what would scaling up look like?