I believe until the hardware designs catch up to be more commodized ala cryto mining evolution from GPUs to ASICS for specfic algos. Designs (like Google TPUs equivalent) would also need to evolve to be more memory dense to be able to handle them. Untill then it seems will be system time shares for the larger models , probably with a bring your own model and pay as you go.
> ala cryto mining evolution from GPUs to ASICS for specfic algos
I don't see it happening. A current gen GPU with a huge and fast block of memory isn't a perfect fit for these algorithms but it's relatively close. With cryptocurrency, mass small sha256 hashing was a totally different kind of computation.