I'd love to see a tokens per second metric. I always prioritize speed over raw intelligence for flash models.
> I always prioritize speed over raw intelligence for flash models.
This model might have a perfect speed:
for i in range(100): print(random.choices(words))
> I always prioritize speed over raw intelligence for flash models.
This model might have a perfect speed: