logoalt Hacker News

noelsusmanyesterday at 6:46 PM1 replyview on HN

The Artificial Analysis benchmark results are pretty underwhelming. Roughly the same "intelligence" as MiMo-V2.5-Pro for over 3x the cost. We'll have to see how that translates to actual usage but it's not a great sign.


Replies

hydra-fyesterday at 7:26 PM

That really depends on whether they have similar parameter counts, doesn't it? Unless you know that, the comparison is just strange

show 3 replies