alt
Hacker News
nlarew
•
yesterday at 8:05 PM
•
0 replies
•
view on HN
The frontier labs are not "fine-tuning", they're doing massive scale RL post-training