This model is breaking records on my benchmark of choice, which is 'the fraction of Hacker News comments that are positive.' Even people who avoid Google products on principle are impressed. Hardly anyone is arguing that ChatGPT is better in any respect (except brand recognition).
Chatgpt 5.2 thinking is significantly better quality for most knowledge work, but it trades off in speed.
That has been my experience. Primarily because it is allowed to expend far more test-time tokens than Gemini 3.0 Pro to solve the same prompt.
And GPT costs 4x as much
No offense, but that seems like a poor benchmark. These initial vibe checks are easily swayed by personal brand biases.
The brand bias is heavily against Google, not in Googles favor
In context of AI I'm mostly seeing anti-OpenAI pro-Google bias.
Facts. These HN threads are half astroturfing and paid shills. Near impossible to decifer authentic takes that are not actual colleagues or people IRL
Fair. No benchmark is perfect.
I do pay special attention to what the most negative comments say (which in this case are unusually positive). And people discussing performance on their own personal benchmarks.
i don't know, chat gpt seems to hallucinate a lot less