Inference costs are higher than training now. I think.
Nvidia is king of general purpose training chips. But inferences can be specialized.
What makes you think this? With wider adoption the ratio shall shift in favor of inference. And API price is becoming more important than SOTA capability.
What makes you think this? With wider adoption the ratio shall shift in favor of inference. And API price is becoming more important than SOTA capability.