DeepSeek V3.2vsDeepSeek-V3.1

Across 5 shared benchmarks, DeepSeek V3.2 leads overall: DeepSeek V3.2 wins 5, DeepSeek-V3.1 wins 0, with 0 ties and an average score difference of +6.36.

DeepSeek V3.2

DeepSeek-AI · 2025-12-01 · Reasoning model

DeepSeek-V3.1

DeepSeek-AI · 2025-08-20 · Chat model

DeepSeek V3.25 wins(100%)(0%)0 winsDeepSeek-V3.1

Benchmark scores

Grouped by capability, sorted by largest gap within each. 5 shared benchmarks.

Coding and Software Engineer

DeepSeek V3.2 2/2

Benchmark	DeepSeek V3.2	DeepSeek-V3.1	Diff
LiveCodeBench	83.3021 / 123Thinking (No Tools)	74.8041 / 123	+8.50
SWE-bench Verified	73.1049 / 112	6674 / 112	+7.10

General Knowledge

DeepSeek V3.2 2/2

Benchmark	DeepSeek V3.2	DeepSeek-V3.1	Diff
HLE	25.10102 / 172Thinking (No Tools)	15.90133 / 172	+9.20
GPQA Diamond	82.4069 / 187Thinking (No Tools)	80.1081 / 187	+2.30

Math and Reasoning

DeepSeek V3.2 1/1

Benchmark	DeepSeek V3.2	DeepSeek-V3.1	Diff
AIME2025	93.1030 / 107Thinking (No Tools)	88.4043 / 107	+4.70

Specs

Field	DeepSeek V3.2	DeepSeek-V3.1
Publisher	DeepSeek-AI	DeepSeek-AI
Release date	2025-12-01	2025-08-20
Model type	Reasoning model	Chat model
Architecture	MoE	MoE
Parameters	671B	671B
Context length	128K	128K
Max output	8K	8K

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

Item	DeepSeek V3.2	DeepSeek-V3.1
Text input	$0.28 / 1M tokens	$0.56 / 1M tokens
Text output	$0.42 / 1M tokens	$1.68 / 1M tokens
Cache read	$0.028 / 1M tokens	$0.28 / 1M tokens
Cache write	$0.28 / 1M tokens	$0.56 / 1M tokens

Summary

DeepSeek V3.2leads in:Coding and Software Engineer (2/2), General Knowledge (2/2), Math and Reasoning (1/1)

On average across the 5 shared benchmarks, DeepSeek V3.2 scores 6.36 higher.

Largest single-benchmark gap: HLE — DeepSeek V3.2 25.10 vs DeepSeek-V3.1 15.90 (+9.20).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.

DeepSeek V3.2 details DeepSeek-V3.1 details·Customize in compare tool