logoalt Hacker News

coffeeritoday at 7:27 AM2 repliesview on HN

There is https://artificialanalysis.ai


Replies

XCSmetoday at 12:25 PM

There are many lists, but I find all of them outdated or containing wrong information or missing the actual benchmarks I'm looking for.

I was thinking, that maybe it's better to make my own benchmarks with the questions/things I'm interested in, and whenever a new model comes out run those tests with that model using open-router.

pplonski86today at 8:14 AM

Thank you! Exactly what I was looking for