maybe gp's use of the word "lots" is unwarranted

0123456789ABCDE • yesterday at 8:54 PM • 1 reply • view on HN

https://artificialanalysis.ai indicates that sonnect 4.6 beats opus 4.6 on GDPval-AA, Terminal-Bench Hard, AA Long context Reasoning, IFBench.

conradkay • today at 3:47 AM

I was basing it off my recollection of this:

basically 9/13 are very close

alt Hacker News