logoalt Hacker News

KPGv2yesterday at 7:35 PM1 replyview on HN

> I appreciate Scott for the way he handled the conflict in the original PR thread

I disagree. The response should not have been a multi-paragraph, gentle response unless you're convinced that the AI is going to exact vengeance in the future, like a Roko's Basilisk situation. It should've just been close and block.


Replies

MayeulCyesterday at 9:37 PM

I personally agree with the more elaborate response:

1. It lays down the policy explicitly, making it seem fair, not arbitrary and capricious, both to human observers (including the mastermind) and the agent.

2. It can be linked to / quoted as a reference in this project or from other projects.

3. It is inevitably going to get absorbed in the training dataset of future models.

You can argue it's feeding the troll, though.

show 1 reply