About

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of overall performance, this model is second only to Qwen3.5-397B-A17B. Its text capabilities significantly outperform those of Qwen3-235B-2507, and its visual capabilities surpass those of Qwen3-VL-235B.

Model Family

Qwen3.5 9B (Reasoning)2026-03-02 Qwen3.5 9B (Non-reasoning)2026-03-02 Qwen3.5 4B (Reasoning)2026-03-02 Qwen3.5 4B (Non-reasoning)2026-03-02 Qwen3.5 2B (Reasoning)2026-03-02 Qwen3.5 2B (Non-reasoning)2026-03-02 Qwen3.5 0.8B (Reasoning)2026-03-02 Qwen3.5 0.8B (Non-reasoning)2026-03-02

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

85.7%

HLE

23.4%

LiveCodeBenchNot evaluated

SciCode

42.0%

TerminalBench Hard

31.1%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

75.7%

Long Context Recall

66.7%

Tau2

93.6%

Market AverageTop Score

Open Source

HuggingFace

apache-2.0122BGGUF / GPTQ / AWQ

Downloads (30d)

611.8K

Likes

452

VRAM (FP16)

Multi-GPU

GPU

8x A100 / H100

Qwen3.5 122B A10B (Reasoning)

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models