Makes you wonder if its possible to squeeze more tps out of a strix halo system using the 16 zen5 co...

christkv • today at 8:04 AM • 2 replies • view on HN

Makes you wonder if its possible to squeeze more tps out of a strix halo system using the 16 zen5 cores as well as the gpu.

Replies

Havoc • today at 8:52 AM

In general you’re mem bandwidth constrained so cpu vs gpu often ends up similar on APUs

➕ show 1 reply

cafkafk • today at 8:20 AM

If you get the inference engine to route the heavy matrix math to the GPU and the speculative drafting to the CPU without choking on latency it's probably gonna be very fast.

Would love to see the benchmarks if someone actually pulls something like that off.

alt Hacker News

Replies