DeepSeek V3.2 Benchmark Details
DeepSeek V3.2 currently shows benchmark results led by LiveCodeBench (19 / 118, score 83.30), AIME2025 (30 / 106, score 93.10), GPQA Diamond (61 / 175, score 82.40). This page also tracks comparisons against 3 predecessor or same-series models. 1 source link is attached for reference.
Benchmark Results
Benchmark Results
综合评估
4 evaluations编程与软件工程
5 evaluationsOpenClaw智能体能力综合测评
2 evaluationsVersion History
How each version of the DeepSeek V3.2 series stacks up on benchmark tests
Benchmark Score Comparison
8 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.· Click a row to view its trend chart.
| Benchmark | DeepSeek V3.2Current | DeepSeek-V3.1 | DeepSeek-V3-0324 | DeepSeek-V3 |
|---|---|---|---|---|
ARC-AGI 综合评估 | 57.00Thinking Enabled | -- | 9.00Standard Mode | -- |
GPQA Diamond 综合评估 | 82.40Thinking Enabled | 80.10Thinking Enabled | 68.40Standard Mode | 59.10Standard Mode |
HLE 综合评估 | 25.10Thinking Enabled | 15.90Thinking Enabled | 5.20Standard Mode | -- |
LiveCodeBench 编程与软件工程 | 83.30Thinking Enabled | 74.80Thinking Enabled | 49.20Standard Mode | 34.60Standard Mode |
SWE-bench Verified 编程与软件工程 | 73.10Thinking Enabled | Tools | 66.00Standard Mode | 38.80Standard Mode | -- |
AIME2025 数学推理 | 93.10Thinking Enabled | 88.40Thinking Enabled | 47.70Standard Mode | -- |
Aider-Polyglot Agent能力评测 | 69.90Thinking Enabled | Tools | 76.30Thinking Enabled | 55.10Standard Mode | -- |
τ²-Bench Agent能力评测 | 80.30Thinking Enabled | Tools | -- | 38.80Standard Mode | Tools | -- |
Single-Benchmark Version Trend
Viewing: ARC-AGI · 综合评估
Standard API Pricing Across the DeepSeek V3.2 Series
Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.
Source: DataLearnerAI. Standard text prices shown here use the default supplier.
These models use different currencies or billing units, so the page falls back to raw price values instead of a shared bar chart.
| Model | Supplier | Standard input | Standard output | Base price applies to |
|---|---|---|---|---|
DeepSeek-V3.1 | — | 0.56 美元/100 万tokens | 1.68 美元/100 万tokens | — |
DeepSeek-V3-0324 | — | 0.27 美元/100万 tokens | 1.1 美元/100万 tokens | — |