logoalt Hacker News

jitlyesterday at 8:03 PM1 replyview on HN

wat


Replies

0123456789ABCDEyesterday at 8:54 PM

maybe gp's use of the word "lots" is unwarranted

https://artificialanalysis.ai indicates that sonnect 4.6 beats opus 4.6 on GDPval-AA, Terminal-Bench Hard, AA Long context Reasoning, IFBench.

see: https://artificialanalysis.ai/?models=claude-sonnet-4-6%2Ccl...