As plenty of others have mentioned here, if inference were 100x cheaper, I would run 200x inference.
There are so many things you can do with long running, continuous inference.
but what if you don't need to run it in the cloud
but what if you don't need to run it in the cloud