logoalt Hacker News

freely0085today at 4:29 AM1 replyview on HN

Can you add Qwen 3.6 max to the leaderboard?


Replies

gertlabstoday at 5:14 AM

We will as soon as API access is widely available. Once a model goes live, we typically have one-shot reasoning benchmarks up in ~8 hours and comprehensive agentic/combined benchmarks up after 24-48 hours. We're working on building relationships with each lab to have the results before launch.