About

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual use, while remaining robust on alignment and formatting. Compared with prior Qwen3 instruct variants, it focuses on higher throughput and stability on ultra-long inputs and multi-turn dialogues, making it well-suited for RAG, tool use, and agentic workflows that require consistent final answers rather than visible chain-of-thought. The model employs scaling-efficient training and decoding to improve parameter efficiency and inference speed, and has been validated on a broad set of public benchmarks where it reaches or approaches larger Qwen3 systems in several categories while outperforming earlier mid-sized baselines. It is best used as a general assistant, code helper, and long-context task solver in production settings where deterministic, instruction-following outputs are preferred.

Model Family

Qwen3.5 9B (Reasoning)2026-03-02 Qwen3.5 9B (Non-reasoning)2026-03-02 Qwen3.5 4B (Reasoning)2026-03-02 Qwen3.5 4B (Non-reasoning)2026-03-02 Qwen3.5 2B (Reasoning)2026-03-02 Qwen3.5 2B (Non-reasoning)2026-03-02 Qwen3.5 0.8B (Reasoning)2026-03-02 Qwen3.5 0.8B (Non-reasoning)2026-03-02

Benchmarks

MMLU-Pro

81.9%

GPQA Diamond

73.8%

HLE

7.3%

LiveCodeBench

68.4%

SciCode

30.7%

TerminalBench Hard

7.6%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

66.3%

IFBench

39.7%

Long Context Recall

51.3%

Tau2

21.6%

Market AverageTop Score

Open Source

HuggingFace

apache-2.080BGGUF / GPTQ / AWQ

Downloads (30d)

885.2K

Likes

950

VRAM (FP16)

Multi-GPU

GPU

8x A100 / H100

Qwen3 Next 80B A3B Instruct

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models