What throws me off is DeepSeek beating both Opus 4.8 and GPT 5.5.
That definitely doesn't sound right.