> Another version of this issue is when you push back but you were NOT "right to push back". In other words, the LLM original solution was better than the pushback.

Indeed, it's easy to surface this by sending one model a "Review" of their proposal to another, then bounce them back and forward, ask which one is best and both models will almost always say something like "The other proposal/review is better", I'm guessing because somehow they think it comes from the human, and "human is always right" or something.