About

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers state-of-the-art performance comparable to leading-edge models across a wide range of tasks, including language understanding, logical reasoning, code generation, agent-based tasks, image understanding, video understanding, and graphical user interface (GUI) interactions. With its robust code-generation and agent capabilities, the model exhibits strong generalization across diverse agent.

Model Family

Qwen3.5 9B (Reasoning)2026-03-02 Qwen3.5 9B (Non-reasoning)2026-03-02 Qwen3.5 4B (Reasoning)2026-03-02 Qwen3.5 4B (Non-reasoning)2026-03-02 Qwen3.5 2B (Reasoning)2026-03-02 Qwen3.5 2B (Non-reasoning)2026-03-02 Qwen3.5 0.8B (Reasoning)2026-03-02 Qwen3.5 0.8B (Non-reasoning)2026-03-02

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

89.3%

HLE

27.3%

LiveCodeBenchNot evaluated

SciCode

42.0%

TerminalBench Hard

40.9%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

78.8%

Long Context Recall

65.7%

Tau2

95.6%

Market AverageTop Score

Open Source

HuggingFace

apache-2.0397BGGUF / GPTQ / AWQ

Downloads (30d)

1.8M

Likes

1.4K

VRAM (FP16)

Multi-GPU

GPU

8x A100 / H100

Qwen3.5 397B A17B (Reasoning)

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models