logoalt Hacker News

juliangoldsmithyesterday at 11:09 PM0 repliesview on HN

That benchmark ranks Kimi K2.6 and K2.7 Code near the bottom. Both are below Ornith 35B. It ranks Gemma 4 26B much higher than GLM-5.2. The results don't make much sense.