Loading...
Loading...
Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE, SwiGLU, RMSNorm, and GQA attention with support for up to 128K tokens using YaRN-based extrapolation. It is trained on a large corpus of source code, synthetic data, and text-code grounding, providing robust performance across programming languages and agentic coding workflows. This model is part of the Qwen2.5-Coder family and offers strong compatibility with tools like vLLM for efficient deployment. Released under the Apache 2.0 license.
Índice de Qualidade
10.0
364th de 442
Top 83%
Preço/1M
$0.00
1st mais barato
100% abaixo da mediana
Top 27%
Velocidade
0 tok/s
TTFT
0.00s
Janela de Contexto
33K
291st maior
Top 91%
Entrada
$0.00
por 1M tokens
Saída
$0.00
por 1M tokens
Combinado
$0.00
por 1M tokens
Mais barato que 73% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.00
Mensal
$0.00
0
tokens/seg
Mais rápido que 0% dos modelos
0.00
segundos
Mais rápido que 61% dos modelos
0.00
segundos
Mais rápido que 61% dos modelos
Mediana do Mercado
46 tok/s
100% mais lento
TTFT Mediano
0.42s
100% mais rápido
Comparação de Velocidade
Janela de Contexto
33K
tokens
Maior que 9% dos modelos
2.5M
669
8-16 GB
RTX 4070 / M2 Pro