Can't you know that tokens are units of thinking just by... like... thinking about how models work?
> Can't you know that tokens are units of thinking just by... like... thinking about how models work?
Seems reasonable, but this doesn't settle probably-empirical questions like: (a) to what degree is 'more' better?; (b) how important are filler words? (c) how important are words that signal connection, causality, influence, reasoning?
Can't you just know that the earth is the center of the world by... like... just looking at how the world works?