About

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall performance is comparable to that of the Qwen3.5-27B.

Model Family

Qwen3.5 9B (Reasoning)2026-03-02 Qwen3.5 9B (Non-reasoning)2026-03-02 Qwen3.5 4B (Reasoning)2026-03-02 Qwen3.5 4B (Non-reasoning)2026-03-02 Qwen3.5 2B (Reasoning)2026-03-02 Qwen3.5 2B (Non-reasoning)2026-03-02 Qwen3.5 0.8B (Reasoning)2026-03-02 Qwen3.5 0.8B (Non-reasoning)2026-03-02

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

84.5%

HLE

19.7%

LiveCodeBenchNot evaluated

SciCode

37.7%

TerminalBench Hard

26.5%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

72.5%

Long Context Recall

62.7%

Tau2

89.2%

Market AverageTop Score

Open Source

HuggingFace

apache-2.035BGGUF / GPTQ / AWQ

Downloads (30d)

2.4M

Likes

1.2K

VRAM (FP16)

48-80 GB

GPU

A100 80GB

Qwen3.5 35B A3B (Reasoning)

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models