logoalt Hacker News

15minutemailtoday at 7:25 AM1 replyview on HN

74% on LCB from a single 5060 Ti. I've been paying Anthropic per task and this guy is running it on electricity money, 20 minutes per task is rough for anything interactive though.


Replies

subroutinetoday at 7:55 AM

At 20 min per task you might as well code it yourself. Bill James needs to write a book on saber-metrics for LLM benchmarks.