Loading...
Loading...
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.
Preço/1M
$0.14
252nd mais barato
56% abaixo da mediana
Top 37%
Janela de Contexto
41K
285th maior
Top 82%
Entrada
$0.05
por 1M tokens
Saída
$0.40
por 1M tokens
Combinado
$0.14
por 1M tokens
Mais barato que 63% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.14
Mensal
$4.12
Janela de Contexto
41K
tokens
Maior que 18% dos modelos
Saída Máxima
8K
tokens
20% do contexto
Comparação de Janela de Contexto
9.0M
1.0K
8-16 GB
RTX 4070 / M2 Pro