Hacker News

This confused me at first as well.. inactive experts skip compute, but weights are sill loaded. So memory does not shrink at all.