Loading...
Loading...
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
Quality Index
19.7
190th of 442
Top 43%
Math Index
29.0
187th of 268
Top 70%
Price/1M
$0.74
426th cheapest
140% above median
Top 63%
Speed
32 tok/s
Top 59%
TTFT
0.41s
Context Window
131K
145th largest
Top 63%
Input
$0.66
per 1M tokens
Output
$1.00
per 1M tokens
Blended
$0.74
per 1M tokens
Cheaper than 37% of models. Median price is $0.31/1M tokens.
Daily
$0.74
Monthly
$22.35
32
tokens/sec
Faster than 41% of models
0.41
seconds
Faster than 51% of models
77.18
seconds
Faster than 1% of models
Market Median
46 tok/s
29% slower
Median TTFT
0.42s
2% faster
Throughput/Dollar
44
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 37% of models
Max Output
131K
tokens
100% of context