logoalt Hacker News

simonwyesterday at 11:11 PM2 repliesview on HN

I got this running on a 128GB M5 the other day - pretty painless, model runs in about 80GB of RAM and it seemed to be very capable at writing code and tool execution.


Replies

perfmodeyesterday at 11:21 PM

How’s the token throughput / response time?

show 1 reply
chatmastatoday at 1:37 AM

So you’re saying I should buy the M5? :) I’ve been resisting, thinking I’ll never use it… it’ll be better in a year… I’ll wait for the Studio (do we still think that’s coming in June?)… etc.

show 1 reply