The current taalas chip is for a 3.1B param model. I’m hope so much that they can get that up to the 30B range. Just imagine Gemma 4 or Qwen 3.6 at 17k tps.

Taalas' first chip is for a Llama 3.1 8B quant, not a 3.1B parameter model, to clarify.