opus@max is on average worst than opux@xhigh

for supporting evidence, see first chart here: https://www.anthropic.com/news/claude-fable-5-mythos-5