DeepSeek-V4-ProvsDeepSeek V3.2

Across 8 shared benchmarks, DeepSeek-V4-Pro leads overall: DeepSeek-V4-Pro wins 5, DeepSeek V3.2 wins 3, with 0 ties and an average score difference of +102.87.

DeepSeek-AI
DeepSeek-V4-Pro

DeepSeek-AI · 2026-04-24 · Reasoning model

DeepSeek-AI
DeepSeek V3.2

DeepSeek-AI · 2025-12-01 · Reasoning model

DeepSeek-V4-Pro5 wins(63%)(38%)3 winsDeepSeek V3.2

Benchmark scores

Grouped by capability, sorted by largest gap within each. 8 shared benchmarks.

Coding and Software Engineer

DeepSeek-V4-Pro 3/4
BenchmarkDeepSeek-V4-ProDeepSeek V3.2Diff
CodeForces3,2062 / 16最高(无工具)2,38611 / 16Thinking (No Tools)+820
LiveCodeBench56.8075 / 120Normal (No Tools)83.3021 / 120Thinking (No Tools)-26.50
SWE-Bench Pro - Public52.1028 / 43Normal (With Tools)40.9038 / 43Thinking (No Tools)+11.20
SWE-bench Verified73.6041 / 108Normal (With Tools)73.1045 / 108+0.50

General Knowledge

DeepSeek V3.2 2/2
BenchmarkDeepSeek-V4-ProDeepSeek V3.2Diff
HLE7.70141 / 157Normal (No Tools)25.1087 / 157Thinking (No Tools)-17.40
GPQA Diamond72.90102 / 178Normal (No Tools)82.4064 / 178Thinking (No Tools)-9.50

AI Agent - Information Search

DeepSeek-V4-Pro 1/1
BenchmarkDeepSeek-V4-ProDeepSeek V3.2Diff
BrowseComp83.409 / 45极高强度思考(工具)51.4035 / 45Thinking (No Tools)+32

AI Agent - Tool Usage

DeepSeek-V4-Pro 1/1
BenchmarkDeepSeek-V4-ProDeepSeek V3.2Diff
Terminal Bench 2.059.1022 / 46Normal (With Tools)46.4039 / 46+12.70

Specs

FieldDeepSeek-V4-ProDeepSeek V3.2
PublisherDeepSeek-AIDeepSeek-AI
Release date2026-04-242025-12-01
Model typeReasoning modelReasoning model
ArchitectureMoEMoE
Parameters1.6T671B
Context length1M128K
Max output375K8K

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

ItemDeepSeek-V4-ProDeepSeek V3.2
Text input$0.435 / 1M tokensNot public
Text output$0.87 / 1M tokensNot public
Cache read$0.87 / 1M tokensNot public
Cache write$0.003625 / 1M tokensNot public

One or both models have incomplete public pricing.

Summary

  • DeepSeek-V4-Proleads in:Coding and Software Engineer (3/4), AI Agent - Information Search (1/1), AI Agent - Tool Usage (1/1)
  • DeepSeek V3.2leads in:General Knowledge (2/2)

On average across the 8 shared benchmarks, DeepSeek-V4-Pro scores 102.87 higher.

Largest single-benchmark gap: CodeForces — DeepSeek-V4-Pro 3,206 vs DeepSeek V3.2 2,386 (+820).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.