Hacker News

Y

Hacker News

new | ask | show | jobs

subroutine 15 hours ago [ - ]

At 20 min per task you might as well code it yourself. Bill James needs to write a book on saber-metrics for LLM benchmarks.