logoalt Hacker News

Havoctoday at 8:52 AM1 replyview on HN

In general you’re mem bandwidth constrained so cpu vs gpu often ends up similar on APUs


Replies

fulafeltoday at 9:35 AM

There are ways to trade off compute power for memory bandwidth (like MTP and other speculative decoding approaches). The CPU and GPU would need to be able to share the same cache for this to work. In the Strix Halo case the GPU has a private cache on the GPU die I think, which is the snag.