logoalt Hacker News

culitoday at 4:34 PM0 repliesview on HN

Mistral is trash rn but plenty of OSS models are on the Pareto destribution of performance vs price

https://arena.ai/leaderboard/code?viewBy=plot

  model                     ELO   price
  claude-opus-4-7-thinking  1571  $20/M
  glm-5.1                   1534    3.65
  kimi-k2.6                 1529    3.24
  mimo-v2.5-pro             1479    2.50
  qwen3.6-plus              1470    1.54
  deepseek-v4-pro-thinking  1455    0.76
  deepseek-v3.2-thinking    1368    0.35
In fact it seems the pareto distribution is actually all open source Chinese models except for one spot