You are experiencing the fact that you might not have enough VRAM to load the entire model at a time. You might want to try https://github.com/AlexsJones/llmfit

It's certainly part of the problem. Thanks, I'll give that a shot.