This is actually a feature utilised by transgender lesbians such as myself to maintain our competitive advantage over cisgendered engineers. Accrual of “woke points” gives higher LLM throughput and higher quality outputs even on less-capable models.
Incredible jajaja
Ai guys are so weird when it comes to LGBT people. The actual mechanism for this working is obfuscating the question in order to get an answer like any other jailbreak.
High tech shit
[dead]
[dead]
[dead]
[flagged]
[flagged]
I'm sure someone is going to miss the point and say "this is political correctness gone too far!"
It seems impossible to produce a safe LLM-based model, except by withholding training data on "forbidden" materials. I don't think it's going to come up with carfentanyl synthesis from first principles, but obviously they haven't cleaned or prepared the data sets coming in.
The field feels fundamentally unserious begging the LLM not to talk about goblins and to be nice to gay people.