After a point, does it matter? Even assuming that models can keep getting smarter indefinitely, it's not like you need to use the smartest model to get the job done.

Moore's law is dead now, so at some threshold purchasing the GPUs to run the biggest and newest model hurts you more than whatever rent you could've extracted from it.

When we get there, why would you want to run a closed model that you can't control, with restrictions you can't remove, that a company can take from you or silently nerf without telling you?