> Smaller models really arent great at structured output.

That doesn't seem to hold true. Consider gpt-5.4-nano which supports structured output just fine.

https://developers.openai.com/api/docs/models/gpt-5.4-nano

It seems like a concern that's orthogonal to the model size.

I genuinely doubt that they are just lying though lol