Worse by every measure.

What metrics are you looking at? Grok 4 outperforms Claude 4 Opus in the Artificial Analysis Intelligence Index.

https://artificialanalysis.ai/leaderboards/models