Added Qwen3 Next to the Brokk Power Ranking Open Round (coding benchmark). It's roughly GPT-OSS...

jbellis • yesterday at 9:30 PM • 2 replies • view on HN

Added Qwen3 Next to the Brokk Power Ranking Open Round (coding benchmark). It's roughly GPT-OSS-20b strength.

SparkyMcUnicorn • yesterday at 11:18 PM

This would be a valuable benchmark if it included languages other than Java, and let me see which models are best at the languages I work with.

My real-world usage does not line up with these results, but I'm not working with Java.

noahbp • yesterday at 9:46 PM

Is that the updated Kimi K2, or the old Kimi k2?

alt Hacker News