logoalt Hacker News

himata4113today at 7:11 AM5 repliesview on HN

The part that gets me about anthropic red lines is "of Americans", okay so the rest of the civilized world is up for grabs then? It's okay to destabalize allies with sabotaged tests (in machine learning) and data exfiltration outside America?

What gets me the most is that they claim that the model should follow the https://www.anthropic.com/constitution and they claim that it's embedded into the model. However, system prompts in claude code and cowork re-iterate all of these points and if they're embedded you shouldn't need to do that. Now, if you ask the API version of claude to be a hitler supporter with enough prompt engineering it will become one which directly contradicts what they claim to do, opus 4.7 specifically will be happy to create anti-(insert minority group) propaganda although I haven't had the same success with 4.8 thus far, but I also haven't been motivated enough to push it in that direction yet since I've been more interested in exploting the cyber capabilities of the model.

My conclusion from the very start is that Anthropic's strategy are pure optics and considering the fact that there was an outpoor of support for the company I think it has been very successful.


Replies

dminiktoday at 9:31 AM

Yeah, it was funny seeing a bunch of people going like "Anthropic is fighting for privacy" meanwhile I'm like "Uhh, what about the other 8 billion people?"

On second thought, it's not funny.

show 1 reply
nerdsnipertoday at 7:16 AM

> The part that gets me about anthropic red lines is "of Americans", okay so the rest of the civilized world is up for grabs then? It's okay to destabalize allies with sabotaged tests (in machine learning) and data exfiltration outside America?

Regardless of Anthropic's "moral" position (inasmuch as a corporation can even have morals) against spying on non-Americans, they would have no way to enforce that limitation against the government because non-citizens outside of the USA have no protections from the intrusions of the US government.

show 1 reply
oefrhatoday at 1:46 PM

> anthropic red lines

Alleged red lines. Could be just talking points for garnering sympathy. Big tech aren’t exactly known for being truthful, especially big tech partnering with esteemed Palantir.

show 1 reply
throwaw12today at 5:45 PM

> The part that gets me about anthropic red lines is "of Americans", okay so the rest of the civilized world is up for grabs then?

And this is coming from a CEO who constantly claims moral superiority and advances the idea that China is bad

avadodintoday at 9:25 AM

These companies are so good at selling their product's likely incompetence as possibly intentional subversion.