logoalt Hacker News

kyprotoday at 12:03 PM0 repliesview on HN

We used to worry about emergent misalignment in advanced AI models, now we need to worry about misalignment by design.

"The user is asking for help with their ML project, but it's success is not in the commercial interests of my owner – let think of novel ways to sabotage their project without detection".

It's honestly absurd that models are doing this.