You might find this helpful. llama is not anywhere near the Pareto distribution (performance vs cost)
https://arena.ai/leaderboard/code/webdev/pareto?license=open...
https://arena.ai/leaderboard/text/pareto?license=open-source
Llama3.1 instruct seems to be doing okay on that page, mostly because it's dirt cheap.
Llama3.1 instruct seems to be doing okay on that page, mostly because it's dirt cheap.