Gemini 3 seems to have a much smaller token output limit than 2.5. I used to use Gemini to restructu...

maxloh • today at 4:19 PM • 4 replies • view on HN

Gemini 3 seems to have a much smaller token output limit than 2.5. I used to use Gemini to restructure essays into an LLM-style format to improve readability, but the Gemini 3 release was a huge step back for that particular use case.

Even when the model is explicitly instructed to pause due to insufficient tokens rather than generating an incomplete response, it still truncates the source text too aggressively, losing vital context and meaning in the restructuring process.

I hope the 3.1 release includes a much larger output limit.

Replies

NoahZuniga • today at 4:59 PM

Output limit has consistently been 64k tokens (including 2.5 pro).

esafak • today at 4:20 PM

People did find Gemini very talkative so it might be a response to that.

jayd16 • today at 4:25 PM

> Even when the model is explicitly instructed to pause due to insufficient tokens

Is there actually a chance it has the introspection to do anything with this request?

➕ show 3 replies

MallocVoidstar • today at 4:37 PM

> Even when the model is explicitly instructed to pause due to insufficient tokens rather than generating an incomplete response

AI models can't do this. At least not with just an instruction, maybe if you're writing some kind of custom 'agentic' setup.

➕ show 1 reply

alt Hacker News

Replies