logoalt Hacker News

I_am_tiberiusyesterday at 10:27 PM6 repliesview on HN

These guardrails are solely a reason for using your data for training purposes. Every flagged message can be used for training.


Replies

Retr0idyesterday at 11:15 PM

This sounds backwards, any interrupted conversation becomes less useful for training.

tekacstoday at 12:25 AM

> We will require 30-day retention for all traffic on Mythos-class models, on both first- and third-party surfaces. We won’t use this data to train new Claude models, or for any non-safety-related purpose

Whatever problem we might have with them, they explicitly say that they do not do this in the launch post.

show 2 replies
wmfyesterday at 10:43 PM

If they can train the classifier to have fewer false positives that would be great.

show 1 reply
autoexectoday at 12:04 AM

I'd expect that everything they see gets used for for training purposes (and data mining in general) regardless of if it's flagged or not. It'd take a whistleblower for you to ever find out either way.

make3today at 12:06 AM

this reasoning is inverted lol they would get a lot more information by letting you use it. so much weird drama around reasonable guardrails for an experimental model

Lord_Zerotoday at 2:59 AM

If we're doing conspiracy theories what if fable is really dumb and not better than opus and the guardrails hide that nicely. Meanwhile the hype train keeps chugging.