Looks like some psychology researchers got taken by the ruse as well.
yeah, I'm confused as well, why would the models hold any memory about red teaming attempts etc? Or how the training was conducted?
I'm really curious as to what the point of this paper is..
yeah, I'm confused as well, why would the models hold any memory about red teaming attempts etc? Or how the training was conducted?
I'm really curious as to what the point of this paper is..