logoalt Hacker News

BugsJustFindMetoday at 1:16 AM1 replyview on HN

Yes. "Show Code", not "Show CPU cycles". There's a difference. Writing code is not the same as running code. It looks to you like it ran the code. But you have no proof that it did. I've seen many times LLM systems from companies that claimed that their LLMs would run code and return the output claiming that they ran some code and returned the output but the output was not what the shown code actually produced when run.


Replies

xVeduntoday at 5:22 AM

Maybe the only way to be sure is to have it generate (not stable diffuse) an image with the value in there.

show 1 reply