why comment on something you clearly don't know anything about? it's on-policy RL trained not just on coding text
listen and learn :)