Would you share some additional details? CPU, amount of unified memory / VRAM? Tok/s with ...

yberreby • last Thursday at 8:53 PM • 1 reply • view on HN

Would you share some additional details? CPU, amount of unified memory / VRAM? Tok/s with those?

cc62cf4a4f20 • last Friday at 5:10 AM

MBP M4 Max 64MB - haven't measured the tokens/sec, feels slower than Claude, but not unbearably

It's not yet perfect, my sense is just that it's near the tipping point where models are efficient enough that running a local model is truly viable

alt Hacker News