logoalt Hacker News

minimaxiryesterday at 5:34 PM0 repliesview on HN

Due to the increasing difficulty of scaling up training, it appears the gains are instead being achieved through better model training which appears to be working well for everyone.