Harness certainly matters a lot, though GLM is pretty forgiving. I just had Opus tell me that based ...

vidarh • today at 9:17 AM • 1 reply • view on HN

Harness certainly matters a lot, though GLM is pretty forgiving. I just had Opus tell me that based on numbers over the last week, from quite a few billion tokens total across half a dozen providers, GLM 5.1 has been more reliable for one of my projects than Sonnet... Just switching on 5.2 now.

Replies

amosjyng • today at 10:54 AM

How are you collecting your metrics on token usage and reliability?

➕ show 1 reply

alt Hacker News

Replies