I started using brain -> workers approach for coding.

-- Brain is expensive smart model from claude subscription, Fable 5 when it was available, Opus now.

-- Worker is a local model (qwen3.6:46B), deployed in 36GB GPU, Opencode + Ollama.

Brain is responsible for analysis/design and task creation. Task should be made simple and clear so the worker can handle it. Worker does the coding. Brain validates and create a fix task when required. Atm fix to task ration is ~ 1:20.

If no available GPU at home - qwen3.6 is quite cheap on clouds.

Its rather experimental setup, out of curiosity, but it works better than I would expect it to. This allows me running 3 coding agents non stop for the 4-th day atm. Here I explain how I got there: https://news.ycombinator.com/item?id=48520757