Sobre

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents. It is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. It extends the DeepSeek-V3 base with a two-phase long-context training process, reaching up to 128K tokens, and uses FP8 microscaling for efficient inference. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config) The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.

Família do Modelo

DeepSeek V3.2 (Reasoning)2025-12-01 DeepSeek V3.2 (Non-reasoning)2025-12-01 DeepSeek V3.2 Speciale2025-12-01 DeepSeek V3.2 Exp (Reasoning)2025-09-29 DeepSeek V3.2 Exp (Non-reasoning)2025-09-29 DeepSeek: DeepSeek V3.2 Exp2025-09-29 DeepSeek V3.1 Terminus (Reasoning)2025-09-22 DeepSeek V3.1 (Non-reasoning)2025-08-21

Benchmarks

MMLU-Pro

83.6%

GPQA Diamond

75.1%

HLE

8.4%

LiveCodeBench

52.9%

SciCode

32.1%

TerminalBench Hard

31.8%

MATH-500Não avaliado

AIMENão avaliado

AIME 2025

53.7%

IFBench

41.2%

Long Context Recall

43.3%

Tau2

37.1%

Média do MercadoMelhor Score

DeepSeek V3.1 Terminus (Non-reasoning)

Sobre

Família do Modelo

Posição no Mercado

Preços

Calculadora de Custo

vs. Modelos Similares

Desempenho

Benchmarks

Open Source

Comparação Rápida

Modelos Similares