Your statement assumes training data is the only thing that matters for the big players, while not considering it limiting for the small Norwegian model. That’s a fallacy.
Nowhere in the article does it say the Norwegian LLM will train _only_ on Norwegian data.
Nowhere in the article does it say the Norwegian LLM will train _only_ on Norwegian data.