Quantization alone does not explain it. It's mostly custom hardware[0].

[0] https://groq.com/the-groq-lpu-explained/