How are they going to be competitive with top models at 70B size?
Qwen et al shows size isn’t actually the only useful metric for an llm.
Qwen et al shows size isn’t actually the only useful metric for an llm.