I have been obsessed with the idea of this for a while, theres a Qwen with Opus reasoning distilled that works nicely as well. I think the next frontier is optimizing the models to be more capable on less hardware especially if it can learn on the fly.