DeepSeek V3.2-Exp Benchmark Details
DeepSeek V3.2-Exp currently shows benchmark results led by SimpleQA (1 / 45, score 97.10), Aider-Polyglot (11 / 59, score 74.20), MMLU Pro (25 / 126, score 85).
Benchmark Results
DeepSeek V3.2-Exp
Benchmark Results
General Knowledge
9 evaluationsBenchmark / mode
Score
Rank/total
Coding and Software Engineer
3 evaluationsBenchmark / mode
Score
Rank/total
Math and Reasoning
2 evaluationsBenchmark / mode
Score
Rank/total
AI Agent - Tool Usage
2 evaluationsBenchmark / mode
Score
Rank/total
Agent Level Benchmark
5 evaluationsBenchmark / mode
Score
Rank/total