It might be that the system prompt sent by codex is not optimal for that model. Try with open code and see if your results improve