logoalt Hacker News

mike_hearntoday at 8:58 AM0 repliesview on HN

You can disaggregate though. So draft models can run on cheaper hardware with less RAM, saving time on the more expensive machines with more RAM.