>People will always use the heaviest model
Actually when doing my first attempt at vibe coding a few months ago, I found that Gemini Flash was fine for my tasks, and way faster than the heavier models. So I found the smaller model a vastly superior user experience.
The speed really adds up when you're using the autonomous coding agents, since they tend to require many LLM calls for a few simple changes.