Only comparing on SOTA scores (ignoring price etc.) is like choosing your daily-driver by looking at who makes the fastest sports-car...
Not really. SOTA vs non SOTA is "can I get my coding work actually done today" vs. "this can do customer support chat"
It is like car vs. kick scooter.
The constant improvements of SOTA are the main thing keeping the investment machine running. We can't really remove training costs from inference costs, because a bunch of the funding and loans for the inference hardware only exists because the promises the continuous training (tries to) provides.