logoalt Hacker News

blharrtoday at 3:44 AM0 repliesview on HN

"The model refuses to follow my specific word detail prompts" and "The model refuses to perform hacking attempts" are on the same side of the model refusing to do something baked into it though.