logoalt Hacker News

alargemooseyesterday at 9:23 PM0 repliesview on HN

I don’t care how practical it may or may not be, this is my new favorite LLM benchmark