logoalt Hacker News

lemonish97yesterday at 6:56 PM1 replyview on HN

What is your evidence for this claim?


Replies

fookeryesterday at 6:59 PM

They say hill climbing

https://microsoft.ai/news/building-a-hillclimbing-machine-la...

Unless they specifically clarify that the testing and training benchmarks are completely separate, we have to assume they test on the same 'hill' the model climbs.

show 3 replies