logoalt Hacker News

jzymbalukyesterday at 4:48 PM1 replyview on HN

You'd still need those giant data centers for training new frontier models. These Taalas chips, if they work, seem to do the job of inference well, but training will still require general purpose GPU compute


Replies

bonoboTPyesterday at 9:01 PM

Next up: wire up a specialized chip to run the training loop of a specific architecture.