Someone recently made a graph showing that the gap between US American frontier LLMs and Chinese open weight LLMs (including DeepSeek v4) is widening. Unfortunately I can't find it anymore.

Update: GPT-5.5 found it.

Article: https://www.nist.gov/news-events/news/2026/05/caisi-evaluati...

Graph: https://www.nist.gov/sites/default/files/images/2026/05/01/1...

Give it time. It's inevitably a logistic curve.

I believe logistic curves make no sense when you have Elo scores.

This is propaganda, not data.

If the Chinese government published a graph that said the opposite, would you consider that a serious and objective source?

If the methodology in the accompanying write-up did look credible, yes. Though I have significantly more trust in US agencies, like NIST in this case.

Someone is an official website of the united states gouvernement. I would prefer another source.

I think no other source exists.