Hacker News

Y

Hacker News

new | ask | show | jobs

alargemoose 3 hours ago [ - ]

I don’t care how practical it may or may not be, this is my new favorite LLM benchmark