logoalt Hacker News

segmondytoday at 3:39 PM1 replyview on HN

Pretty cool, what they need is to build a tool that can take any model to chip in short a time as possible. How quick can they give me DeepSeek, Kimi, Qwen or GLM on a chip? I'll take 5k tk/sec for those!


Replies

throwaw12today at 3:47 PM

also imagine it will cost 300$/unit, we all will host our own set of models locally, dream dream