Loading...
Loading...
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
Price/1M
$0.11
239th cheapest
63% below median
Top 35%
Context Window
1.0M
28th largest
Top 11%
Input
$0.07
per 1M tokens
Output
$0.26
per 1M tokens
Blended
$0.11
per 1M tokens
Cheaper than 65% of models. Median price is $0.31/1M tokens.
Daily
$0.11
Monthly
$3.41
Context Window
1.0M
tokens
Larger than 89% of models
Max Output
66K
tokens
7% of context
Context Window Comparison