I read that IBM pioneered the concept of "shifting through "mid-training" from "...

mdp2021 • yesterday at 5:03 PM • 0 replies • view on HN

I read that IBM pioneered the concept of "shifting through "mid-training" from "guessing the next token" to "guessing the next logical step"". I am wondering how far is the research from "enhancing apparent reasoning" to "achieving solid, reliable reasoning".

If techniques existed to shift from "guess the next highly probable" token to "guess the best next logical step", as some interpreted said research, should not that be the foremost objective?

alt Hacker News