Loading...
Loading...
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
Quality Index
33.5
77th of 442
Top 18%
Coding Index
30.1
81st of 352
Top 23%
Price/1M
$0.56
400th cheapest
82% above median
Top 59%
Speed
230 tok/s
Top 4%
TTFT
5.17s
Context Window
1.0M
8th largest
Top 6%
Input
$0.25
per 1M tokens
Output
$1.50
per 1M tokens
Blended
$0.56
per 1M tokens
Cheaper than 41% of models. Median price is $0.31/1M tokens.
Daily
$0.56
Monthly
$16.89
230
tokens/sec
Faster than 96% of models
5.17
seconds
Faster than 9% of models
5.17
seconds
Faster than 25% of models
Market Median
46 tok/s
404% faster
Median TTFT
0.42s
1136% slower
Throughput/Dollar
408
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 94% of models
Max Output
66K
tokens
6% of context