logoalt Hacker News

MichaelNolanyesterday at 4:35 PM1 replyview on HN

The current taalas chip is for a 3.1B param model. I’m hope so much that they can get that up to the 30B range. Just imagine Gemma 4 or Qwen 3.6 at 17k tps.


Replies

coder543today at 2:16 AM

Taalas' first chip is for a Llama 3.1 8B quant, not a 3.1B parameter model, to clarify.