About

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the Qwen3 architecture, it supports a native context length of 256K tokens (extendable to 1M with Yarn) and performs strongly in tasks involving function calls, browser use, and structured code completion. This model is optimized for instruction-following without “thinking mode”, and integrates well with OpenAI-compatible tool-use formats.

Model Family

Qwen3.5 9B (Reasoning)2026-03-02 Qwen3.5 9B (Non-reasoning)2026-03-02 Qwen3.5 4B (Reasoning)2026-03-02 Qwen3.5 4B (Non-reasoning)2026-03-02 Qwen3.5 2B (Reasoning)2026-03-02 Qwen3.5 2B (Non-reasoning)2026-03-02 Qwen3.5 0.8B (Reasoning)2026-03-02 Qwen3.5 0.8B (Non-reasoning)2026-03-02

Benchmarks

MMLU-Pro

70.6%

GPQA Diamond

51.6%

HLE

4.0%

LiveCodeBench

40.3%

SciCode

27.8%

TerminalBench Hard

15.2%

MATH-500

89.3%

AIME

29.7%

AIME 2025

29.0%

IFBench

32.7%

Long Context Recall

29.0%

Tau2

34.5%

Market AverageTop Score

Open Source

HuggingFace

apache-2.030BGGUF / GPTQ / AWQ

Downloads (30d)

1.0M

Likes

981

VRAM (FP16)

24-48 GB

GPU

A6000 / M3 Ultra

Qwen3 Coder 30B A3B Instruct

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models