logoalt Hacker News

orbital-decayyesterday at 8:04 PM1 replyview on HN

Training on the CoT itself is pretty dubious since it's reward hacked to some degree (as evident from e.g. GLM-4.7 which tried pulling that with 3.0 Pro, and ended up repeating Model Armor injections without really understanding/following them). In any case they aren't trying to hide it particularly hard.


Replies

FergusArgyllyesterday at 8:12 PM

> In any case they aren't trying to hide it particularly hard.

What does that mean? Are you able to read the raw cot? how?