logoalt Hacker News

amosjyngyesterday at 10:54 AM1 replyview on HN

How are you collecting your metrics on token usage and reliability?


Replies

vidarhyesterday at 12:21 PM

They are from my own runs, with reliability measured in terms of passing extensive test suites. So caveat is that this applies for my specific use and might well vary greatly.