ChatGPT is very good at code-reviewing Claude’s work and finds the howlers in it fairly reliably

Playing off different LLMs against one another in that kind of manner is a good way to expose some first order errors.