does this run on CPUs as well? Anyone faced any issues? or do you prefer to run using APIs from model providers and aggregators such as openrouter, qubrid etc