logoalt Hacker News

mocamocayesterday at 8:17 AM1 replyview on HN

Would you mind explaining your point a view? Or point me to ressources making you think so?


Replies

nradovyesterday at 1:29 PM

What can be asserted without evidence can also be dismissed without evidence. The benchmark creators haven't demonstrated that higher scores result in fewer humans dying or any meaningful outcome like that. If the LLM outputs some naughty words that's not an actual safety problem.