logoalt Hacker News

bellowsgulchtoday at 4:39 PM3 repliesview on HN

*Anthropic apologizes they got caught defending their moat by implementing invisible Claude Fable guardrails


Replies

simonwtoday at 4:50 PM

If by "got caught" you mean "published it in their system card paper".

(Admittedly it was buried pretty deep in that 300+ page PDF, but they did at least disclose it. If they hadn't I imagine it would have taken quite some time for the research community to figure out what was going on.)

show 3 replies
afthonostoday at 4:53 PM

They didn’t get caught, they explicitly said they would do that in the announcement. I think it was both bad and a weird idea, but it certainly wasn’t sneaky.

cyanydeeztoday at 4:40 PM

is it a moat or just a way to implement the permanent underclass?