About

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents. It is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. It extends the DeepSeek-V3 base with a two-phase long-context training process, reaching up to 128K tokens, and uses FP8 microscaling for efficient inference. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config) The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.

Model Family

DeepSeek V3.2 (Reasoning)2025-12-01 DeepSeek V3.2 (Non-reasoning)2025-12-01 DeepSeek V3.2 Speciale2025-12-01 DeepSeek V3.2 Exp (Reasoning)2025-09-29 DeepSeek V3.2 Exp (Non-reasoning)2025-09-29 DeepSeek: DeepSeek V3.2 Exp2025-09-29 DeepSeek V3.1 Terminus (Reasoning)2025-09-22 DeepSeek V3.1 (Non-reasoning)2025-08-21

Benchmarks

MMLU-Pro

83.6%

GPQA Diamond

75.1%

HLE

8.4%

LiveCodeBench

52.9%

SciCode

32.1%

TerminalBench Hard

31.8%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

53.7%

IFBench

41.2%

Long Context Recall

43.3%

Tau2

37.1%

Market AverageTop Score

DeepSeek V3.1 Terminus (Non-reasoning)

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models