DeepSeek V3.2

Name: DeepSeek V3.2 (正式版)
Author: DeepSeek-AI

推理大模型

DeepSeek V3.2 (正式版)

Release date: 2025-12-01更新于: 2026-04-08 15:40:09.1183,393

Live demo GitHub Hugging Face Compare

Parameters

6710.0亿

Context length

128K

Chinese support

Supported

Reasoning ability

DeepSeek V3.2 是 DeepSeek 于2026年1月发布的通用旗舰模型，671B 参数，采用 DSA 稀疏注意力机制，结合大规模强化学习与 agent 任务合成训练，性能对标 GPT-5，支持工具调用与思考模式，MIT 协议开源。

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeek V3.2

Model basics

Reasoning traces

Supported

Thinking modes

Thinking Level · On (Default)Thinking Level · Off

Context length

128K tokens

Max output length

8192 tokens

Model type

推理大模型

Release date

2025-12-01

Model file size

1.34TB

MoE architecture

Yes

Total params / Active params

6710.0B / 370B

Knowledge cutoff

No data

DeepSeek V3.2

Open source & experience

Code license

DEEPSEEK LICENSE AGREEMENT

Weights license

DEEPSEEK LICENSE AGREEMENT- 免费商用授权

GitHub repo

https://github.com/deepseek-ai/DeepSeek-V3.2-Exp

Hugging Face

https://huggingface.co/deepseek-ai/DeepSeek-V3.2

Live demo

https://chat.deepseek.com

DeepSeek V3.2

Official resources

Paper

DeepSeek-V3.2 正式版发布与说明

DataLearnerAI blog

复杂问题推理能力大幅提升，DeepSeekAI发布DeepSeek V3.2正式版本以及一个评测结果可以媲美Gemini 3.0 Pro的将开源模型推到极限性能的DeepSeek-V3.2-Speciale模型

DeepSeek V3.2

API details

API speed

3/5

No public API pricing yet.

DeepSeek V3.2

Benchmark Results

DeepSeek V3.2 currently shows benchmark results led by LiveCodeBench (13 / 110, score 83.30), AIME2025 (30 / 107, score 93.10), GPQA Diamond (53 / 166, score 82.40). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

综合评估

4 evaluations

Benchmark / mode

Score

Rank/total

GPQA Diamond

82.40

53 / 166

ARC-AGI

28 / 56

HLE

25.10

62 / 129

ARC-AGI-2

39 / 50

编程与软件工程

5 evaluations

Benchmark / mode

Score

Rank/total

CodeForces

2386

7 / 10

LiveCodeBench

83.30

13 / 110

SWE-bench Verified

70.20

44 / 96

SWE-bench Verified

OnTools

73.10

33 / 96

SWE-Bench Pro - Public

40.90

20 / 25

数学推理

2 evaluations

Benchmark / mode

Score

Rank/total

AIME2025

93.10

30 / 107

AIME 2026

92.70

5 / 12

Agent能力评测

2 evaluations

Benchmark / mode

Score

Rank/total

τ²-Bench

OnTools

80.30

14 / 40

Aider-Polyglot

OnTools

69.90

12 / 26

AI Agent - 信息收集

1 evaluations

Benchmark / mode

Score

Rank/total

BrowseComp

51.40

25 / 35

AI Agent - 工具使用

1 evaluations

Benchmark / mode

Score

Rank/total

Terminal Bench 2.0

OnTools

46.40

25 / 32

OpenClaw智能体能力综合测评

2 evaluations

Benchmark / mode

Score

Rank/total

Pinch Bench

OnTools

84.30

18 / 37

Claw Bench

OnTools

20 / 27

View benchmark analysis Compare with other models

DeepSeek V3.2

Publisher

DeepSeek-AI

View publisher details

DeepSeek V3.2 (正式版)

Model Overview

DeepSeek V3.2 系列共包含三个版本：V3.2-Exp、V3.2 正式版和 V3.2-Speciale。V3.2-Exp 是2025年9月发布的实验版本，用于验证新架构；V3.2 正式版于2026年1月发布，是完成完整后训练流程的通用旗舰版本；V3.2-Speciale 是同期发布的高算力推理变体，专为极限推理场景设计。

DeepSeek V3.2 正式版于2026年1月10日发布，总参数 671B，每个 token 激活 37B 参数，上下文窗口 128K，知识截止日期为2025年5月，MIT 协议开源。架构上沿用了 V3.2-Exp 已验证的 DSA 稀疏注意力机制，但后训练部分做了大幅扩展，是 V3.2 系列中面向通用场景的完整产品版本。

后训练阶段，DeepSeek 在 V3.2 上做了两项较大的投入。一是大幅扩展强化学习的计算预算，后训练阶段消耗的算力超过预训练总量的 10%，通过规模化 RL 来提升模型的推理与规划能力，而不只是依赖预训练数据的堆砌。二是构建了大规模 agent 任务合成流水线，覆盖超过 1,800 个不同环境和 85,000 个 agent 任务，涵盖搜索、编程、工具调用等场景，为 RL 训练提供数据支撑。这批合成任务的难度足够高——即使是 frontier 闭源模型在这些任务上的准确率也不超过 62%，确保了 RL 训练的有效性。

V3.2 也是 DeepSeek 首个将思考链整合进工具调用流程的模型，同时支持思考模式和非思考模式下的工具调用。这使得模型在执行多步 agent 任务时，可以在调用外部工具之前先生成推理过程，而不是直接执行。

性能方面，V3.2 在多个推理 benchmark 上达到了与 GPT-5 和 Kimi K2-thinking 相近的水平，在 agent 场景下的表现也显著缩小了与闭源前沿模型的差距。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送