Loading...
Loading...
Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this variant emphasizes responsiveness to complex user directions and robust chat interactions while retaining strong capabilities on reasoning and coding benchmarks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Instruct reflects the Olmo initiative’s commitment to openness and transparency.
Quality Index
12.2
326th of 442
Top 74%
Coding Index
5.6
302nd of 352
Top 86%
Price/1M
$0.30
329th cheapest
3% below median
Top 50%
Speed
54 tok/s
Top 46%
TTFT
0.47s
Context Window
66K
271st largest
Top 80%
Input
$0.20
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.30
per 1M tokens
Cheaper than 50% of models. Median price is $0.31/1M tokens.
Daily
$0.30
Monthly
$9.00
54
tokens/sec
Faster than 54% of models
0.47
seconds
Faster than 46% of models
0.47
seconds
Faster than 49% of models
Market Median
46 tok/s
18% faster
Median TTFT
0.42s
12% slower
Throughput/Dollar
179
tok/s per $/1M
Speed Comparison
Context Window
66K
tokens
Larger than 20% of models