Totally. Best-in-class for SWE work (until Mythos gets released, if ever, but I suspect the rumored "Spud" will be out by then too)
Totally. Best-in-class for SWE work (until Mythos gets released, if ever, but I suspect the rumored "Spud" will be out by then too)
It really isn’t. I wish it was, because work complains about overuse of Opus.
It really is, for complex tasks. Claude excels at low-mid complexity (CRUD apps, most business apps). For anything somewhat out of the distribution, codex at the moment has no peer.
I find that more experienced devs are more likely to prefer Codex… anecdotal but… it’s a thing.
This is because no one bothers to set thinking to high, as it now defaults to medium in CC.
Once you set thinking to high it works just as well as 5.4 even for pretty complex tasks
I have always used Claude at max thinking levels since it launched. It has never been up to the task. For clarity, the task being this: https://github.com/tsoniclang/tsonic
Meanwhile, there are half a dozen other projects (business apps, web apps etc) where it works well.