See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

DeepSeek-V4-Pro
DeepSeek-AI
Best overall
DeepSeek-V4-Pro · 79.98
Best single
DeepSeek-V4-Pro · LiveCodeBench 93.50
Modality coverage
DeepSeek-V4-Pro · 2 modalities
Head to head
5
Benchmarks
5
Wins
0
Losses
+15.62
Average diff
Compare benchmark results across thinking modes and tool usage.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Complete scores for each model/mode across selected benchmarks.
5 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.
| Benchmark | DeepSeek-V4-Pro | DeepSeek-V3.1 |
|---|---|---|
GPQA Diamond 综合评估 | 90.10Thinking Level · High | 80.10Thinking Enabled |
HLE 综合评估 | 48.20Thinking Level · Extra High | Tools | 15.90Thinking Enabled |
MMLU Pro 综合评估 | 87.50Thinking Level · High | 85.00Thinking Enabled |
LiveCodeBench 编程与软件工程 | 93.50Thinking Level · High | 74.80Thinking Enabled |
SWE-bench Verified 编程与软件工程 | 80.60Thinking Level · Extra High | Tools | 66.00Standard Mode |
Side-by-side input/output token pricing
Licensing, MoE architecture, and multi-modality support.
| Features & specs | DeepSeek-V4-ProDeepSeek-AI | DeepSeek-V3.1DeepSeek-AI |
|---|---|---|
Core specsRelease | 2026-04-24 | 2025-08-20 |
Context length | 1M | 128K |
Parameters | 16000 | 6710 |
Active parameters | 490 | 370 |
Max output | 384000 | 8192 |
MoE | Yes | Yes |
Supported modes | No mode data | 常规模式(Non-Thinking Mode)思考模式(Thinking Mode) |
LicenseCode Open Source | Closed Source | Closed Source |
Weights Open Source | Closed Source | Closed Source |
Commercial use | 免费商用授权 | 免费商用授权 |
Modality supportText Input/Output | / | / |
Image Input/Output | / | Not provided |
ResourcesPaper / report | DeepSeek-V4 Technical Report | DeepSeek-V3.1 Release |
DataLearner blog | Not provided | DeepSeek V4没有等到,但是DeepSeekAI把DeepSeek V3升级到DeepSeek V3.1了,小幅更新,但核心架构和参数不变 |

DeepSeek-V3.1
DeepSeek-AI