It’s possible they’re using some new architecture to get more up-to-date data, but I think that’d be even more of a headline.
My hunch is that this is the same 5.1 post-training on a new pretrained base.
Likely rushed out the door faster than they initially expected/planned.