Sobre

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: - AIME 2024 pass@1: 70.0 - MATH-500 pass@1: 94.5 - CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Família do Modelo

DeepSeek V3.2 (Reasoning)2025-12-01 DeepSeek V3.2 (Non-reasoning)2025-12-01 DeepSeek V3.2 Speciale2025-12-01 DeepSeek V3.2 Exp (Reasoning)2025-09-29 DeepSeek V3.2 Exp (Non-reasoning)2025-09-29 DeepSeek: DeepSeek V3.2 Exp2025-09-29 DeepSeek V3.1 Terminus (Reasoning)2025-09-22 DeepSeek V3.1 Terminus (Non-reasoning)2025-09-22

Benchmarks

MMLU-Pro

79.5%

GPQA Diamond

40.2%

HLE

6.1%

LiveCodeBench

26.6%

SciCode

31.2%

TerminalBench Hard

1.5%

MATH-500

93.5%

AIME

67.0%

AIME 2025

53.7%

IFBench

27.6%

Long Context Recall

11.0%

Tau2

21.9%

Média do MercadoMelhor Score

DeepSeek R1 Distill Llama 70B

Sobre

Família do Modelo

Posição no Mercado

Preços

Calculadora de Custo

vs. Modelos Similares

Desempenho

Benchmarks

Open Source

Comparação Rápida

Modelos Similares