If the model "improves" every 5 hours, how do you have any guarantee of model consistency across long coding sessions?

Yeah, this feels tricky

If the model changes every few hours, we’re basically debugging against a moving target - and that gets expensive fast.

What do you need consistency for

[dead]