logoalt Hacker News

sourcecodeplztoday at 4:57 PM1 replyview on HN

It runs right now on 512gb RAM Macs and PCs.


Replies

Our_Benefactorstoday at 8:16 PM

It runs like shit though in terms of tokens/second and still has a reduced context window. Vs a single claude prompt can easily get into 300k tokens without breaking a sweat.

I want local AI to be a thing but the hardware isn’t here yet, because the only options are a Mac Studio or DGX machines strapped together. RAM prices needs to crash before local AI has a chance at actually competing.

show 1 reply