logoalt Hacker News

slopinthebagyesterday at 6:26 PM1 replyview on HN

Claude Code gets smoked on benchmarks by an agent that has a single tool: tmux. So I think they might actually like that quite a bit.


Replies

HarHarVeryFunnyyesterday at 7:19 PM

What benchmarks are you referring to?