Hacker News

babelfish a day ago [ - ]

Totally. Best-in-class for SWE work (until Mythos gets released, if ever, but I suspect the rumored "Spud" will be out by then too)

girvo a day ago [ - ]

It really isn’t. I wish it was, because work complains about overuse of Opus.

jeswin a day ago [ - ]

It really is, for complex tasks. Claude excels at low-mid complexity (CRUD apps, most business apps). For anything somewhat out of the distribution, codex at the moment has no peer.

ttul 18 hours ago [ - ]

I find that more experienced devs are more likely to prefer Codex… anecdotal but… it’s a thing.

xvector 17 hours ago [ - ]

This is because no one bothers to set thinking to high, as it now defaults to medium in CC.

Once you set thinking to high it works just as well as 5.4 even for pretty complex tasks

jeswin 16 hours ago [ - ]

I have always used Claude at max thinking levels since it launched. It has never been up to the task. For clarity, the task being this: https://github.com/tsoniclang/tsonic

Meanwhile, there are half a dozen other projects (business apps, web apps etc) where it works well.