logoalt Hacker News

smusamashahyesterday at 9:03 PM0 repliesview on HN

I only want to see how it performs on the Bullshit-benchmark https://petergpt.github.io/bullshit-benchmark/viewer/index.v...

GPT is not even close yo Claude in terms of responding to BS.