How would it be a money grab? If the new tokenizer requires more tokens to encode the same informati...

ChadNauseam • today at 6:30 AM • 2 replies • view on HN

How would it be a money grab? If the new tokenizer requires more tokens to encode the same information, it costs them more money for inference. The point of charging per token is that the cost is proportional to the number of tokens. That's my understanding anyway

Replies

abrookewood • today at 6:47 AM

Because everyone burns through their limits much faster, forcing them to upgrade to higher limits or new tiers.

➕ show 2 replies

msp26 • today at 10:11 AM

Not necessarily with speculative decoding. Whitespace would be trivial to predict and they would petty much keep using the same amount of compute as before.

I don't think that's their primary motive for doing this but it is a side effect.

alt Hacker News

Replies