Good models will require multiple Taalas chips but Groq and Cerebras also require a lot of chips and that hasn't stopped them.
> Good models will require multiple Taalas chips
I guess that makes sense. Is this feasible, or does the added latency between chips kill any of the performance gains?
> Good models will require multiple Taalas chips
I guess that makes sense. Is this feasible, or does the added latency between chips kill any of the performance gains?