logoalt Hacker News

asaryesterday at 6:04 PM5 repliesview on HN

$1.5/m input tokens $9/m output tokens

6x the price of 3.1 flash lite


Replies

Auncheyesterday at 7:09 PM

"Flash-Lite" is a different product from "Flash", which is more expensive. They couldn't be more confusing with their naming though, especially since they have 3.1 Pro and not 3.1 Flash non-lite.

WarmWashyesterday at 6:40 PM

I haven't used 3.5 at all yet, but previous Gemini (and Gemma models) are by far the most token light per task than any other model.

Cost per task is a more productive measure, but obviously a more difficult one to benchmark.

iwhalenyesterday at 6:09 PM

I wonder why they didn't discuss price in the post?

Compare to the GPT-5.5 announcement: https://openai.com/index/introducing-gpt-5-5/

himata4113yesterday at 6:07 PM

I don't think input/output pricing matters, 90% of the cost is cache. $0.15 is pretty good, but still very expensive.

show 4 replies
John7878781yesterday at 6:07 PM

[deleted]

show 1 reply