Wow, I can even chat about C code with that model with LM Studio on my Macbook at 200 tokens per seconds

Haha, we never trained it for chat but I would bet it works regardless.

Also that's crazy, M4 Mac?

M4 Max 128GB yeah