I don't think you will get that anytime soon because for a model to work well with something like openclaw it needs a massive context window.

but but but but unified memory! (jk, I don't actually believe in Apple marketing words)

There might be future optimizations. Like, have your small model do COT to find where to look for memory that is relevant.