if you use this as a rough gauge: https://openrouter.ai/models?order=top-weekly
Llama Meta 70b is 50th or so down the list of popular models.
It has 24.1b tokens used in 7 days vs the top models that have trillions or hundreds of billions of tokens.
So practically dead!
Is that biased towards code generation? As opposed to application features using LLMs, which I think is more what we’re talking about.