Yes, I did see that section. We've known for a while that reward hacking, train/test data ...

mzelling • last Sunday at 5:50 PM • 0 replies • view on HN

Yes, I did see that section. We've known for a while that reward hacking, train/test data contamination, etc. must be taken seriously. Researchers are actively guarding against these problems. This paper explores what happens when researchers flip their stance and actively try to reward hack — how far can they push it? The answer is "very far."

alt Hacker News