Does it? Or is this a centaur situation where a competent human can fix it in about two minutes?

Yes! Sometimes when models get something wrong with less widely used programming languages, I like to just cancel the current inference, fix something myself, then tell the harness/model that I fixed the current problem, and to move on.

Define competent. This is the difference between having a product manager able to prototype and having a product manager need to work with an engineer.