logoalt Hacker News

gavmoryesterday at 4:13 AM1 replyview on HN

The overall system that allowed this implementation is accountable. So why put such a fine point on it so as to exculpate the LLM?


Replies

im3w1lyesterday at 5:05 AM

It helps set expectations for the fix. "The bug was in an external system that has now been fixed" means we it's probably fine going forward. "The LLM got tricked but we are gonna train it super hard not to do that again" means it will break again and again as people find new angles to convince it.