Yeah, right. If this benchmark was truly developed in an independent manner, and the timing just “lined up”, how did Anthropic even know to include results in their model release documentation the day after the benchmark is revealed? It seems like there must have been some collaboration or influence from Anthropic behind the scenes.

Come on, why are you a jerk about this?

Nobody would have 800+ billion reasons to lie by commission or omission here.