Loading...
Loading...
Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling.
Preço/1M
$0.80
437th mais barato
157% acima da mediana
Top 64%
Janela de Contexto
131K
145th maior
Top 63%
Entrada
$0.45
por 1M tokens
Saída
$1.82
por 1M tokens
Combinado
$0.80
por 1M tokens
Mais barato que 36% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.80
Mensal
$23.89
Janela de Contexto
131K
tokens
Maior que 37% dos modelos
Saída Máxima
8K
tokens
6% do contexto
Comparação de Janela de Contexto
740.8K
1.1K
Multi-GPU
8x A100 / H100