Hacker News

I have 32GB of RAM with 16GB VRAM and I haven't had a lot of luck running larger models like this. Are you able to expand on that?

slim a day ago [ - ]

use llama.cpp with cuda

The problem may be that it's a 7800XT which handles memory contention by freezing.