They say hill climbing | alt Hacker News

fooker • yesterday at 6:59 PM • 3 replies • view on HN

They say hill climbing

https://microsoft.ai/news/building-a-hillclimbing-machine-la...

Unless they specifically clarify that the testing and training benchmarks are completely separate, we have to assume they test on the same 'hill' the model climbs.

Replies

artemisart • yesterday at 8:47 PM

Hill climbing doesn't mean much but absolutely doesn't imply they cheat on benchmarks. They have more details here https://microsoft.ai/news/introducing-mai-thinking-1/ it seems to be "RL on everything".

➕ show 1 reply

jongalloway2 • yesterday at 7:38 PM

[dead]

ajyoon • yesterday at 7:19 PM

[flagged]