I was surprised by the ranking, until I read what the test was. Not horribly relevant for coding.
The current ranking of all tests makes more sense (well, except for how well Gemini does)
The ranking of gold medals only makes sense if all models would gave participate all tests.
DNP = Did not participate
In this regard, kimi got more and better medals than Claude.
Well, the link you provided basically confirms Kimi's dominance.
If you look at the ranking breakdown though, Kimi K2.6 has only participated in the last 5 challenges (claude dominated before then) and if you only count those it would be in first place