Loading...
Loading...
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.
Quality Index
13.0
306th of 442
Top 69%
Coding Index
11.2
244th of 352
Top 70%
Math Index
24.0
199th of 268
Top 75%
Price/1M
$0.17
277th cheapest
44% below median
Top 42%
Speed
149 tok/s
Top 14%
TTFT
0.36s
Context Window
1.0M
23rd largest
Top 7%
Input
$0.10
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.17
per 1M tokens
Cheaper than 58% of models. Median price is $0.31/1M tokens.
Daily
$0.17
Monthly
$5.25
149
tokens/sec
Faster than 86% of models
0.36
seconds
Faster than 55% of models
0.36
seconds
Faster than 57% of models
Market Median
46 tok/s
226% faster
Median TTFT
0.42s
13% faster
Throughput/Dollar
849
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 93% of models
Max Output
33K
tokens
3% of context