DeepSeek V3.2vsDeepSeek-V3.1

在 5 个共同 benchmark 中，DeepSeek V3.2 整体领先：DeepSeek V3.2 领先 5 项，DeepSeek-V3.1 领先 0 项，持平 0 项，平均分差 +6.36。

DeepSeek-AI · 2025-12-01 · 推理大模型

DeepSeek-AI · 2025-08-20 · 聊天大模型

DeepSeek V3.25 项(100%)(0%)0 项DeepSeek-V3.1

评测分数

按能力类目分组，每组内按分差大小排列；共 5 项。

DeepSeek V3.2 领先 2/2

评测项	DeepSeek V3.2	DeepSeek-V3.1	分差
LiveCodeBench	83.3021 / 123Thinking (No Tools)	74.8041 / 123	+8.50
SWE-bench Verified	73.1049 / 112	6674 / 112	+7.10

DeepSeek V3.2 领先 2/2

评测项	DeepSeek V3.2	DeepSeek-V3.1	分差
HLE	25.10102 / 172Thinking (No Tools)	15.90133 / 172	+9.20
GPQA Diamond	82.4069 / 187Thinking (No Tools)	80.1081 / 187	+2.30

DeepSeek V3.2 领先 1/1

评测项	DeepSeek V3.2	DeepSeek-V3.1	分差
AIME2025	93.1030 / 107Thinking (No Tools)	88.4043 / 107	+4.70

价格优先使用 DataLearner 配置的 API 记录；缺失项不做推测。

DeepSeek V3.2在以下类目领先:Coding and Software Engineer (2/2)、General Knowledge (2/2)、Math and Reasoning (1/1)

5 个共同 benchmark 上，DeepSeek V3.2 平均高出 6.36 分。

单项差距最大的 benchmark：HLE — DeepSeek V3.2 25.10，DeepSeek-V3.1 15.90（分差 +9.20）。

本页正文由结构化模型、价格与 benchmark 数据生成，不使用实时 LLM 撰写。