Can you add Qwen 3.6 max to the leaderboard?

freely0085 • today at 4:29 AM • 1 reply • view on HN

Replies

We will as soon as API access is widely available. Once a model goes live, we typically have one-shot reasoning benchmarks up in ~8 hours and comprehensive agentic/combined benchmarks up after 24-48 hours. We're working on building relationships with each lab to have the results before launch.

alt Hacker News

Replies