the fact that more tokens = more smart should be expected given cot / thinking / other tec...

ShowalkKama • yesterday at 12:09 PM • 2 replies • view on HN

the fact that more tokens = more smart should be expected given cot / thinking / other techniques that increase the model accuracy by using more tokens.

Did you test that ""caveman mode"" has similar performance to the ""normal"" model?

Replies

Garlef • yesterday at 12:18 PM

Yes but: If the amount is fixed, then the density matters.

A lot of communication is just mentioning the concepts.

bitexploder • yesterday at 2:46 PM

That is part of it. They are also trained to think in very well mapped areas of their model. All the RHLF, etc. tuned on their CoT and user feedback of responses.

alt Hacker News

Replies