> Most research converges to the idea that RL on synthetic data makes models worse, not better.

marcosdumay • yesterday at 12:35 PM • 0 replies • view on HN

You are missing a mountain of nuance by generalizing the existence of a hole there.

alt Hacker News