logoalt Hacker News

vidarhtoday at 9:17 AM1 replyview on HN

Harness certainly matters a lot, though GLM is pretty forgiving. I just had Opus tell me that based on numbers over the last week, from quite a few billion tokens total across half a dozen providers, GLM 5.1 has been more reliable for one of my projects than Sonnet... Just switching on 5.2 now.


Replies

amosjyngtoday at 10:54 AM

How are you collecting your metrics on token usage and reliability?

show 1 reply