DeepSeek-V3 Benchmark Details
DeepSeek-V3 currently shows benchmark results led by BBH (3 / 20, score 92.30), MATH (7 / 42, score 87.80), HumanEval (9 / 39, score 89).
Benchmark Results
DeepSeek-V3
Benchmark Results
综合评估
5 evaluationsBenchmark / mode
Score
Rank/total
数学推理
4 evaluationsBenchmark / mode
Score
Rank/total