If it's anything like ClaudeCode's ultracode, it's nothing new or revolutionary.
It's essentially a bunch of subagents being called by a deterministic script written by the main model thread, each eating tokens for lunch and output of which is synthesized by an orchestrator agent.
The fact that it's even named Ultra is pretty telling.
Ultra expensive
Confusion is: ultracode is not a different model with its own benchmarks
Neither is OpenaAI's ultra. Article specifically calls it 'mode' and it's not even mentioned in the model card.
It's for sure a codex harness feature.
EDIT: yeah, it's the same thing. https://github.com/openai/codex/blob/main/codex-rs/core/test...
>> If it's anything like ClaudeCode's ultracode, it's nothing new or revolutionary.
OpenAI flat out copying Anthropic is a pretty funny development. It's strong evidence that they've been in catch-up mode.
Eh, pretty much everyone that spent some time tweaking their harness already had a homemade 'ultracode' long before Anthropic did it.
OpenAI is just way more careful with what features they add or enable by default in their harness. Anthropic's harness is a junk drawer of random features, with a new feature added every few hours. It feels like they're in panic mode, dropping random things to see what sticks when models are eventually commoditized.
I prefer OpenAI way - slow and steady.