Loading...
Loading...
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.
Price/1M
$0.14
252nd cheapest
56% below median
Top 37%
Context Window
41K
285th largest
Top 82%
Input
$0.05
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.14
per 1M tokens
Cheaper than 63% of models. Median price is $0.31/1M tokens.
Daily
$0.14
Monthly
$4.12
Context Window
41K
tokens
Larger than 18% of models
Max Output
8K
tokens
20% of context
Context Window Comparison
9.0M
1.0K
8-16 GB
RTX 4070 / M2 Pro