This pelican is actually bad, did you use xhigh?

yep, just double checked used gpt-5.4 xhigh. Though had to select it in codex as don't have access to it on the chatgpt app or web version yet. It's possible that whatever code harness codex uses, messed with it.

this is proof they are not benchmaxxing the pelican's :-)