Loading...
Loading...
Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed for efficient local deployment. The model achieves 81% accuracy on the MMLU benchmark and performs competitively with larger models like Llama 3.3 70B and Qwen 32B, while operating at three times the speed on equivalent hardware. [Read the blog post about the model here.](https://mistral.ai/news/mistral-small-3/)
Price/1M
$0.06
202nd cheapest
81% below median
Top 30%
Context Window
33K
291st largest
Top 91%
Input
$0.05
per 1M tokens
Output
$0.08
per 1M tokens
Blended
$0.06
per 1M tokens
Cheaper than 70% of models. Median price is $0.31/1M tokens.
Daily
$0.06
Monthly
$1.72
Context Window
33K
tokens
Larger than 9% of models
Max Output
16K
tokens
50% of context
Context Window Comparison