> The blog post implies that it currently requires 96GB of VRAM. Has anyone tested what happens...

zozbot234 • today at 6:49 AM • 1 reply • view on HN

> The blog post implies that it currently requires 96GB of VRAM.

Has anyone tested what happens if you try and run this on lower-RAM Macs? It might work and just be a bit slower as it falls back on fetching model layers from storage.

Replies

conradkay • today at 7:03 AM

It'd be way slower since you'd be doing that work every token

➕ show 1 reply

alt Hacker News

Replies