Huh. I heard a podcast with the founder talking about their custom hardware, but quantization would explain it.
Quantization alone does not explain it. It's mostly custom hardware[0].
[0] https://groq.com/the-groq-lpu-explained/
Quantization alone does not explain it. It's mostly custom hardware[0].
[0] https://groq.com/the-groq-lpu-explained/