Quantization alone does not explain it. It's mostly custom hardware[0].
[0] https://groq.com/the-groq-lpu-explained/