DeepSeek-V3.1 Benchmark Details
DeepSeek-V3.1 currently shows benchmark results led by MMLU (1 / 65, score 93.40), SimpleQA (4 / 45, score 93.40), AIME 2024 (7 / 62, score 93.10).
Benchmark Results
DeepSeek-V3.1
Benchmark Results
General Knowledge
7 evaluationsBenchmark / mode
Score
Rank/total
Coding and Software Engineer
3 evaluationsBenchmark / mode
Score
Rank/total
Math and Reasoning
4 evaluationsBenchmark / mode
Score
Rank/total
Agent Level Benchmark
2 evaluationsBenchmark / mode
Score
Rank/total