logoalt Hacker News

maxlohyesterday at 10:10 PM3 repliesview on HN

Other fully open LLMs include Allen AI's OLMo 3.1 and MBZUAI's K2 Think V2, both of which have released their full training pipelines and datasets.

Nvidia Nemotron is also an open training source model, though a portion of its dataset remains proprietary.

Quoting lambda's comment:

> Note that the Nemotron models are generally stronger than Olmo and K2 Think V2 (according to Artificial Analysis benchmarks), and there is a lot of overlap in their datasets (lots of datasets are based on the same sources with different filtering, Olmo and K2 Think V2 both have used some Nemotron datasets).

> But yeah, Nemotron is a modern and fairly capable LLM, even the 122b is more capable than Deepseek R1 (a 671b model) on most benchmarks, and there's also the recently released 550b Ultra now.

https://news.ycombinator.com/item?id=48492439


Replies

soundworldstoday at 1:24 AM

Allen AI do not get enough love. They are doing GenAI how it should have always been done.

In fact, if the frontier companies had taken their approach, it would have started much slower, but I think we would be far more advanced by 2035. Instead we have a majority of society that wants to see AI fail.

show 6 replies
typtoday at 7:49 AM

> an open training source model

It's always funny to see people tempted to call open-blobs/open-weights, which are literally shareware like WinRAR or Adobe PDF Viewer, open source, and then need to invent a new term for what is actually open source.

show 1 reply
vcryantoday at 12:36 AM

Maybe I'll give Nemotron another try. Yesterday I used the latest one on OpenRouter and it was bad - worse than StepFun