I've been using 5.3-Codex. I cannot proof because it's subjective, but I have better results (you could say more reasonable) with it than 4.6 Opus.
GPT-5.4 one-shot a cross-language issue (a C++ repo + some amount of Lua), Opus kept hallutinating. This was debugging, not codegen.