Huh. I heard a podcast with the founder talking about their custom hardware, but quantization would explain it.

Quantization alone does not explain it. It's mostly custom hardware[0].

[0] https://groq.com/the-groq-lpu-explained/