Oh yeah I did test various solutions and different settings and quants

Llama is about 1/3 slower on Apple Silicon.