Think past on-device inference... imagine what on-device training could do. And that would need a lot of RAM.