I got about 7 tokens/sec generation on an M2 max macbook running 8-bit quant on an MLX version.