Based on DeepSWE, Opus 4.8 gets you more intelligent output at lower price (GLM's token inefficiency is really biting them). GPT5.5 even moreso. And I don't recall about Opus but GPT is much, much faster at getting you the answer (again, GLM's token inefficiency).
It's neat, I guess, that we can compare them against models released last year, but I care about my options today, and the pareto frontier is about as far away as it ever was.
Add on top of that the extra features OpenAI and Anthropic have in their apps and...
As per the article, they are now about 6 months behind US frontier models, that's down from 9 months. The gap is closing