logoalt Hacker News

eltetoyesterday at 6:53 PM6 repliesview on HN

Local models will never compete with large SOTA models, in the same way an iPhone doesn't compete with supercomputers doing nuclear simulations.

They paths will differentiate and split. Probably SOTA models will eventually be locked down and only accessible to state actors because of how expensive they will be to run (already started with Mythos).


Replies

pvtmerttoday at 9:51 AM

You don't need a (one huge) model to do everything. You need specialized & smaller models that are very good at specific tasks. Collaborating among themselves.

The fact that we see stagnation in terms of billions of parameters shows that efficiency does not scale linearly with the model size. More of an S shaped chart. The middle was Claude 3.5. Since then, it is more about integrating and collaborating with different systems.

tancoptoday at 8:29 AM

> SOTA models will eventually be locked down

that might be true for us based providers but i dont see china turning closed source anytime soon.

a lot of chinese labs come from big non ai focused cloud services (alibaba, tencent, huawei) who want new models with higher benchmark scores and lower inference cost. they dont care if the competition gets better because its all open so they can build off each others tech, and if anything happens they got other profitable services to fall back on instead of depend on llms only like anthropic.

also the business culture is way different, in vc backed america you would get laughed out the room for saying "there is no moat we just do the same thing as everyone but better". you need to show infinite potential growth and lock everything down to prevent competition but you can get millions to start with no customers and no profits. in china its all about the real money they dont care if your margin is 10 or 90 percent as long as you stay profitable. the llm providers are profitable so they keep their business model.

pheggsyesterday at 7:28 PM

its a big assumption that larger models bring any measurable benefit in the long term. there's a point where its not worth paying the expense of a bigger model and we dont know where that will be as both, models and hardware improves.

we do know however where evolution is at right now with our brains, but thats probably not comparable - yet the only thing I can see to make any kind of prediction at all

MarsIronPIyesterday at 8:06 PM

Isn't Mythos mostly hype though?

show 1 reply
stevenhuangyesterday at 7:31 PM

Current local models already compete.

show 1 reply
luguyesterday at 8:05 PM

You are missing the point. Parents says the market to win need economical models more than SOTA models. Whoever is running those nuclear simulations is not making as much as Apple.

show 1 reply