TFA's benchmark was MLPerf, which doesn't require CUDA as Intel has their own Arc plugin. But actually try to run llama.cpp on Arc and it is a roll of the dice.