RL is more than facts. Synthetic feedback is an obvious approach. Does the model suggest code that c...

slashdave • yesterday at 5:51 PM • 0 replies • view on HN

RL is more than facts. Synthetic feedback is an obvious approach. Does the model suggest code that compiles and performs well?

alt Hacker News