Claude 3.5 Sonnet Benchmark Details
Claude 3.5 Sonnet currently shows benchmark results led by HumanEval (5 / 39, score 92), MMLU (18 / 65, score 88.30), MATH (18 / 42, score 71.10).
Benchmark Results
Claude 3.5 Sonnet
Benchmark Results
综合评估
3 evaluationsBenchmark / mode
Score
Rank/total
数学推理
3 evaluationsBenchmark / mode
Score
Rank/total