Everything I run, even the small models, some amount goes to the GPU and the rest to RAM.