Loading...
Loading...
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
Preço/1M
$0.11
239th mais barato
63% abaixo da mediana
Top 35%
Janela de Contexto
1.0M
28th maior
Top 11%
Entrada
$0.07
por 1M tokens
Saída
$0.26
por 1M tokens
Combinado
$0.11
por 1M tokens
Mais barato que 65% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.11
Mensal
$3.41
Janela de Contexto
1.0M
tokens
Maior que 89% dos modelos
Saída Máxima
66K
tokens
7% do contexto
Comparação de Janela de Contexto