My experience so far- much less reliable. Though it’s been in chat not opencode or antigravity etc. you give it a program and say change it in this way, and it just throws stuff away, changes unrelated stuff etc. completely different quality than pro (or sonnet 4.5 / GPT-5.2)
Been thinking of having Opus generate plans and then having Gemini 3 Flash execute. Might be better than using Haiku for the same.
Anyone tried something similar already?
So why Flash is so high in LiveCodeBench Pro?
BTW: I have the same impression, Claude was working better for me for coding tasks.