logoalt Hacker News

bonsai_spooltoday at 5:28 AM0 repliesview on HN

> On individual tasks Claude and GPT are comparable

That is not what the first graphs show - the Anthropic models cluster at 'better' positions on the graph, and I imagine you could show that the values are significantly different.