qwen3.5 9b runs okay on my 12GB gaming GPU. It's very stupid as a coding agent but it's possible to get useful work out of it.
qwen3.5 9b runs okay on my 12GB gaming GPU. It's very stupid as a coding agent but it's possible to get useful work out of it.
I am experimenting with LFM2.5-8B-1A and getting 250tps on a 3060