My M5 Pro is getting ~11 tokens per second via OMLX for an 8 bit quant.