The article's slides mention how much of an engineering challenge it is just for them to clean their data and create new hardware and software flows to use the data for training. So perhaps it is a big learning exercise to build up institutional / national knowledge of LLM creation.