About

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE, SwiGLU, RMSNorm, and GQA attention with support for up to 128K tokens using YaRN-based extrapolation. It is trained on a large corpus of source code, synthetic data, and text-code grounding, providing robust performance across programming languages and agentic coding workflows. This model is part of the Qwen2.5-Coder family and offers strong compatibility with tools like vLLM for efficient deployment. Released under the Apache 2.0 license.

Model Family

Qwen3.5 9B (Reasoning)2026-03-02 Qwen3.5 9B (Non-reasoning)2026-03-02 Qwen3.5 4B (Reasoning)2026-03-02 Qwen3.5 4B (Non-reasoning)2026-03-02 Qwen3.5 2B (Reasoning)2026-03-02 Qwen3.5 2B (Non-reasoning)2026-03-02 Qwen3.5 0.8B (Reasoning)2026-03-02 Qwen3.5 0.8B (Non-reasoning)2026-03-02

Benchmarks

MMLU-Pro

47.3%

GPQA Diamond

33.9%

HLE

4.8%

LiveCodeBench

12.6%

SciCode

14.8%

TerminalBench HardNot evaluated

MATH-500

66.0%

AIME

5.3%

AIME 2025Not evaluated

IFBenchNot evaluated

Long Context RecallNot evaluated

Tau2Not evaluated

Market AverageTop Score

Open Source

HuggingFace

apache-2.07BGGUF / GPTQ / AWQ

Downloads (30d)

2.5M

Likes

669

VRAM (FP16)

8-16 GB

GPU

RTX 4070 / M2 Pro

Qwen2.5 Coder Instruct 7B

About

Model Family

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models