Likely reasoning is part of the original model. It is well known that it is not possible to get a 1b...

ashater • today at 4:53 PM • 0 replies • view on HN

Likely reasoning is part of the original model. It is well known that it is not possible to get a 1bn parameter model to reason, even with RL.

alt Hacker News