logoalt Hacker News

barbacoayesterday at 3:46 PM2 repliesview on HN

Try looking into Ryzen AI Max 395. AMD made a CPU/GPU soc with unified memory specifically for ai inference. Can buy mini PCs with up to 128gb ram.


Replies

krzykyesterday at 5:08 PM

Isn't CUDA/nvidia the go to solution for most local models, with the rest being second class citizents?

show 1 reply
simple10yesterday at 4:09 PM

The Ryzen AI Max 395 128gb is super cool, but not fast for inference. Order of magnitude slower than dedicated GPU but at half the cost. You can run larger models on it but it's slow. Great for local async work. Not great for daily chat or code agent driver.

show 1 reply