Presumably at some point the rapid progress of models will plateau, at least insofar as a model coul...

simondotau • yesterday at 6:16 AM • 2 replies • view on HN

Presumably at some point the rapid progress of models will plateau, at least insofar as a model could be frozen in time and remain economically useful for the expected life of hardware. Especially if it comes with compelling benefits e.g. dramatically lower latency and/or dramatically higher performance per watt.

If you can build chips that could run one specific LLM 100x faster than anything else, it would have a use case that nothing else could match.

Replies

lsaferite • yesterday at 1:43 PM

Those taalus chips apparently run at 1/10 the power as the current SOTA GPU setups. If they can execute even partially on their plan, it'll be a literal game changer.

fragmede • yesterday at 7:38 AM

https://www.cerebras.ai/ is exactly that! Holy shit it's fast.

➕ show 1 reply

alt Hacker News

Replies