Hacker News

Y

Hacker News

new | ask | show | jobs

kovezd 2 days ago [ - ]

By the composition of evals. Plus secondary metrics like parameter size, and token cost.

Not perfect, but useful.