Well, you can take the output of a first pass and pass it back through the model like AR “reasoning” models do at inference time.

Yes and has this been tried?

Yes, Mercury 2 is a reasoning model [0].

[0] https://docs.inceptionlabs.ai/get-started/models#mercury-2