You can use lots of open weight models today.

That's one solution to the problem. But it still needs some good computational capabilities. Either we optimize the hell out of those models, or we wait for the hardware to become good enough for them.

The real problem is the hardware to run them is still very expensive.