I have both of those cards. Llama.cpp with SYCL has thus far refused to work for me, and Vulkan is p...

MrDrMcCoy • yesterday at 6:20 AM • 1 reply • view on HN

I have both of those cards. Llama.cpp with SYCL has thus far refused to work for me, and Vulkan is pretty slow. Hoping that some fixes come down the pipe for SYCL, because I have plenty of power for local models (on paper).

Replies

cmxch • yesterday at 3:40 PM

Hmm.

I had to rebuild llama.cpp from source with the SYCL and CPU specific backends.

Started with a barebones Ubuntu Server 24 LTS install, used the HWE kernel, pulled in the Intel dependencies for hardware support/oneapi/libze, then built llama.cpp with the Intel compiler (icx?) for the SYCL and NATIVE backends (CPU specific support).

In short, built it based mostly on the Intel instructions.

alt Hacker News

Replies