Hacker News

>People will always use the heaviest model

Actually when doing my first attempt at vibe coding a few months ago, I found that Gemini Flash was fine for my tasks, and way faster than the heavier models. So I found the smaller model a vastly superior user experience.

The speed really adds up when you're using the autonomous coding agents, since they tend to require many LLM calls for a few simple changes.