It's great that we are getting so many open source model releases, but I just feel like SOTA mo...

evilturnip • yesterday at 9:30 PM • 3 replies • view on HN

It's great that we are getting so many open source model releases, but I just feel like SOTA models will always be in the hands of the big players. The hardware requirement to achieve SOTA are just too steep.

My alternate universe would involve some sort of decentralized investing scheme to build data centers running massive open source models that could compete on some level with Anthropic, OpenAI, etc.

Replies

jazzyjackson • yesterday at 10:23 PM

There is the possibility of large model weights being exfil’d, either internally or maybe ChatGPT 6.2 will decide to escape its sandbox by ftp’ing itself to the internet archive*

* I heard from a public archive tour, that either OpenAI or Anthropic approached the organization as a partner to train on their materials (raw book scans and full web crawls for past 30 years) and the Archive was willing so long as the weights were shared in exchange. No dice!

LPisGood • today at 12:16 AM

Do we really care about this gap? If open models are 6 months to a year behind frontier models, does it really matter that much?

➕ show 3 replies

romanovcode • today at 9:06 AM

If they keep gatekeeping the SOTA models then who cares - not like you can use them anyway. So for general public the open models become the SOTA models sooner or later.

alt Hacker News

Replies