logoalt Hacker News

nlarewyesterday at 8:05 PM0 repliesview on HN

The frontier labs are not "fine-tuning", they're doing massive scale RL post-training