Claude Opus 4 Benchmark Details
Claude Opus 4 currently shows benchmark results led by MATH-500 (3 / 44, score 98.20), MMLU Pro (25 / 126, score 85), Aider-Polyglot (13 / 59, score 72).
Benchmark Results
Claude Opus 4
Benchmark Results
General Knowledge
5 evaluationsBenchmark / mode
Score
Rank/total
Coding and Software Engineer
2 evaluationsBenchmark / mode
Score
Rank/total
Math and Reasoning
9 evaluationsBenchmark / mode
Score
Rank/total
Writing and Creative Capabilities
1 evaluationsBenchmark / mode
Score
Rank/total
Agent Level Benchmark
3 evaluationsBenchmark / mode
Score
Rank/total