But it's incredibly incapable compared to SOTA models. OP wants high quality output but doesn't need it fast. Your suggestion would mean slow AND low quality output.
Set your parameters to make that point then. “Yeah just run a 1T+ model on CPU”
Set your parameters to make that point then. “Yeah just run a 1T+ model on CPU”