> unless I guess they routinely index this repo This sounds like exactly the kind of thing any ...

wooger • last Friday at 11:50 AM • 1 reply • view on HN

> unless I guess they routinely index this repo

This sounds like exactly the kind of thing any tech company would do when confronted with a competitive benchmark.

Replies

I mean, the repo has <200 stars, it's not like it's so mainstream that you'd expect LLM makers to be watching it actively. If they wanted to game it, they could more easily do that in RL with synthetic data anyway.

alt Hacker News

Replies