logoalt Hacker News

cyberclimbyesterday at 12:05 PM0 repliesview on HN

Note that these results are specific to gpt-4o so it's unclear how much they generalize.

They note at the end they're also testing "GPT o3, and Claude" but no empircal results are included.