Hacker News

cubefox 7 hours ago [ - ]

Someone recently made a graph showing that the gap between US American frontier LLMs and Chinese open weight LLMs (including DeepSeek v4) is widening. Unfortunately I can't find it anymore.

Update: GPT-5.5 found it.

Article: https://www.nist.gov/news-events/news/2026/05/caisi-evaluati...

Graph: https://www.nist.gov/sites/default/files/images/2026/05/01/1...

mordae 5 hours ago [ - ]

Give it time. It's inevitably a logistic curve.

cubefox 4 hours ago [ - ]

I believe logistic curves make no sense when you have Elo scores.

tirpen 5 hours ago [ - ]

This is propaganda, not data.

If the Chinese government published a graph that said the opposite, would you consider that a serious and objective source?

cubefox 4 hours ago [ - ]

If the methodology in the accompanying write-up did look credible, yes. Though I have significantly more trust in US agencies, like NIST in this case.

lugu 5 hours ago [ - ]

Someone is an official website of the united states gouvernement. I would prefer another source.

cubefox 5 hours ago [ - ]

I think no other source exists.