Loading...
Loading...
GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.
Quality Index
48.1
14th of 442
Top 3%
Coding Index
51.5
4th of 352
Top 1%
Price/1M
$1.69
531st cheapest
445% above median
Top 78%
Speed
230 tok/s
Top 4%
TTFT
2.28s
Context Window
400K
41st largest
Top 16%
Input
$0.75
per 1M tokens
Output
$4.50
per 1M tokens
Blended
$1.69
per 1M tokens
Cheaper than 22% of models. Median price is $0.31/1M tokens.
Daily
$1.69
Monthly
$50.64
230
tokens/sec
Faster than 96% of models
2.28
seconds
Faster than 12% of models
2.28
seconds
Faster than 27% of models
Market Median
46 tok/s
405% faster
Median TTFT
0.42s
445% slower
Throughput/Dollar
136
tok/s per $/1M
Speed Comparison
Context Window
400K
tokens
Larger than 84% of models
Max Output
128K
tokens
32% of context