logoalt Hacker News

zx76yesterday at 7:41 PM4 repliesview on HN

I see a lot of people writing about how expensive the hardware to run these local models is - but see no mentions of the Intel Arc Pro B50/B60/B70 which seem like decent value if you're not interested in Apple kit (as much as anything can be decent value in the current status quo).

I just got a B70 with 32GB RAM for the equivalent of $1200 (incl. sales tax and import duties to my non-US location, so presumably it could be cheaper elsewhere). The memory bandwidth is 608 GB/s. For M5 Max (32-core GPU) it's 460 GB/s and for M5 Max (40-core GPU) it's 614 GB/s. A 3090 is still faster at ~900 GB/s but you're getting 32GB VRAM for a lot less than equivalent Nvidia cards. It's about 1/3 the bandwidth of a 5090 for 1/3 the cost, but with the same 32GB VRAM. If you're interested in being able to run bigger quants with some context and stay on a lower budget then it's an appealing trade off.

I'm still exploring using these local models so don't want to spend the equivalent of $5 000 - $10 000 just to test it out. I don't mind slightly slower perf to do some experimentation more affordably.

I actually got an B50 16GB (with meager 70w TDP!) first to test an Intel card with my stack - it worked easily with Ubuntu & Vulkan. I'd read a lot about hassles and people writing them off as unusable but it seems like these are often with SYCL which doesn't even seem to outperform vulkan and so why bother? (The B50 was just $370 inclusive tax and duties). Literally `apt install` the vulkan libraries and it worked with default xe driver in 26.04 and the vulkan build of llama.cpp. The SR-IOV PF/VF also just works with qemu/kvm, no tricks required. Since I got it fwupdmgr has updated the firmware twice so Intel is presumably actually trying to support these products.


Replies

androiddrewtoday at 3:53 PM

I try to always mention that AMD ROCm has come a long way. Like the B70 the Radeon AI Pro 9700 has 32GB of DDR6 640GB/s. Also $1300 a card. Very capable cards now in mid 2026. Great for dense models in the 30B range. I'd go strix halo or DGX spark if you want to run the 120B range of MOE models.

bblbtoday at 3:25 AM

I got B70 few days ago. Running on CachyOS. 9070XT on PCIe x16 and B70 on the x4.

ROCm nightly was pretty easy to setup and get up running. The 9070XT has been a decent card for my use cases.

But the SYCL ecosystem versions. Absolutely horrendous and everything is hundred commits behind. Vulkan is probably the only way forward with this card.

1-6today at 4:38 PM

A 3090 never came with 32gb of vram

kristianptoday at 3:16 AM

Interesting that Intels latest consumer GPUs only have 10 and 12GB respectively for the B570 and B580.