One of my favorite code comments of all time is still in the src:
"# TODO: implement a proper validator to compare against ground truth. For now we just check for exact string match on each line of stdout." [1]
This was one of my chief complaints about the entire R1 news cycle, it felt like no one actually read the technical report. They were being heralded for their openness, but they left out the most meaningful details that you'd need to reproduce their work.
[1] https://github.com/huggingface/open-r1/blob/1416fa0cf21595d2...
> This was one of my chief complaints about the entire R1 news cycle
For me it was the headline that a group of students replicated GPT-3 for $5000
Reminds me of my days in a computational physics PhD program.