This is literally what I'm waiting for. I want a ~8B model that works well with OpenClaw.
I don't think you will get that anytime soon because for a model to work well with something like openclaw it needs a massive context window.
but but but but unified memory! (jk, I don't actually believe in Apple marketing words)
There might be future optimizations. Like, have your small model do COT to find where to look for memory that is relevant.
Qwen 9B doesn't?
Nothing is really usable outside Opus.
I've tried too. Wasted a few days trying out even high end paid models.
I don't think you will get that anytime soon because for a model to work well with something like openclaw it needs a massive context window.
but but but but unified memory! (jk, I don't actually believe in Apple marketing words)
There might be future optimizations. Like, have your small model do COT to find where to look for memory that is relevant.
Qwen 9B doesn't?
Nothing is really usable outside Opus.
I've tried too. Wasted a few days trying out even high end paid models.