logoalt Hacker News

kristjanssonyesterday at 12:50 AM1 replyview on HN

200 GB is an unfathomable amount of main memory for a CPU

(with apologies for snark,) give gpt-oss-120b a try. It’s not fast at all, but it can generate on CPU.


Replies

awestrokeyesterday at 7:27 AM

But it's incredibly incapable compared to SOTA models. OP wants high quality output but doesn't need it fast. Your suggestion would mean slow AND low quality output.

show 1 reply