Tbf, most of the "real benchmarks" have issues that are just as bad. Assessing LLM perform...

wongarsu • today at 8:05 AM • 1 reply • view on HN

Tbf, most of the "real benchmarks" have issues that are just as bad. Assessing LLM performance is just hard

oceansky • today at 1:13 PM

And personal too. Different engineers are using them for different use cases.

alt Hacker News