Loading...
Loading...
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.
Quality Index
19.4
194th of 442
Top 44%
Coding Index
14.5
196th of 352
Top 57%
Math Index
46.7
144th of 268
Top 54%
Price/1M
$0.17
277th cheapest
44% below median
Top 42%
Speed
330 tok/s
Top 2%
TTFT
0.41s
Context Window
1.0M
8th largest
Top 6%
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 58% of models. Median price is $0.31/1M tokens.
Daily
$0.17
Monthly
$5.25
330
tokens/sec
Faster than 98% of models
0.41
seconds
Faster than 51% of models
0.41
seconds
Faster than 52% of models
Market Median
46 tok/s
623% faster
Median TTFT
0.42s
1% faster
Throughput/Dollar
1884
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 94% of models
Max Output
66K
tokens
6% of context