logoalt Hacker News

simple10yesterday at 4:09 PM1 replyview on HN

The Ryzen AI Max 395 128gb is super cool, but not fast for inference. Order of magnitude slower than dedicated GPU but at half the cost. You can run larger models on it but it's slow. Great for local async work. Not great for daily chat or code agent driver.


Replies

throwa356262yesterday at 4:49 PM

The latest NPUs are pretty fast, I think what is missing is more optimised software support.

show 1 reply