Loading...
Loading...
Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.
Price/1M
$0.13
247th cheapest
58% below median
Top 37%
Context Window
41K
285th largest
Top 82%
Input
$0.08
per 1M tokens
Output
$0.28
per 1M tokens
Blended
$0.13
per 1M tokens
Cheaper than 63% of models. Median price is $0.31/1M tokens.
Daily
$0.13
Monthly
$3.90
Context Window
41K
tokens
Larger than 18% of models
Max Output
41K
tokens
100% of context
Context Window Comparison
1.3M
868
24-48 GB
A6000 / M3 Ultra