About

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: - AIME 2024 pass@1: 70.0 - MATH-500 pass@1: 94.5 - CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Model Family

DeepSeek V3.2 (Reasoning)2025-12-01 DeepSeek V3.2 (Non-reasoning)2025-12-01 DeepSeek V3.2 Speciale2025-12-01 DeepSeek V3.2 Exp (Reasoning)2025-09-29 DeepSeek V3.2 Exp (Non-reasoning)2025-09-29 DeepSeek: DeepSeek V3.2 Exp2025-09-29 DeepSeek V3.1 Terminus (Reasoning)2025-09-22 DeepSeek V3.1 Terminus (Non-reasoning)2025-09-22

Benchmarks

MMLU-Pro

79.5%

GPQA Diamond

40.2%

HLE

6.1%

LiveCodeBench

26.6%

SciCode

31.2%

TerminalBench Hard

1.5%

MATH-500

93.5%

AIME

67.0%

AIME 2025

53.7%

IFBench

27.6%

Long Context Recall

11.0%

Tau2

21.9%

Market AverageTop Score

DeepSeek R1 Distill Llama 70B

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models