logoalt Hacker News

SXXtoday at 9:44 AM2 repliesview on HN

Now we need someone try run Kimi K2.6 on old Xeon and DDR3. After all these platforms do support up to 768GB RAM.


Replies

segmondytoday at 3:24 PM

You can run these on a turing machine. At what point is it not worth it? At some point the energy to generate each token matters. We often seen token per second. I think a missing metric is tokens per kilowatt. That is what really matters.

Havoctoday at 11:16 AM

It’ll work but yield a token per minute. With ancient servers the throughput is the limiting aspect not mem size