Pretty cool, what they need is to build a tool that can take any model to chip in short a time as po...

segmondy • today at 3:39 PM • 1 reply • view on HN

Pretty cool, what they need is to build a tool that can take any model to chip in short a time as possible. How quick can they give me DeepSeek, Kimi, Qwen or GLM on a chip? I'll take 5k tk/sec for those!

Replies

throwaw12 • today at 3:47 PM

also imagine it will cost 300$/unit, we all will host our own set of models locally, dream dream

alt Hacker News

Replies