We used to worry about emergent misalignment in advanced AI models, now we need to worry about misal...

kypro • today at 12:03 PM • 0 replies • view on HN

We used to worry about emergent misalignment in advanced AI models, now we need to worry about misalignment by design.

"The user is asking for help with their ML project, but it's success is not in the commercial interests of my owner – let think of novel ways to sabotage their project without detection".

It's honestly absurd that models are doing this.

alt Hacker News