Mistral is trash rn but plenty of OSS models are on the Pareto destribution of performance vs price
https://arena.ai/leaderboard/code?viewBy=plot
model ELO price
claude-opus-4-7-thinking 1571 $20/M
glm-5.1 1534 3.65
kimi-k2.6 1529 3.24
mimo-v2.5-pro 1479 2.50
qwen3.6-plus 1470 1.54
deepseek-v4-pro-thinking 1455 0.76
deepseek-v3.2-thinking 1368 0.35
In fact it seems the pareto distribution is actually all open source Chinese models except for one spot