Claude Sonnet 4 Benchmark Details
Claude Sonnet 4 currently shows benchmark results led by SWE-bench Verified (13 / 108, score 80.20), Terminal-Bench (10 / 35, score 41.30), MMLU Pro (37 / 126, score 84). 1 source link is attached for reference.
Benchmark Results
Claude Sonnet 4
Benchmark Results
General Knowledge
12 evaluationsBenchmark / mode
Score
Rank/total
Coding and Software Engineer
6 evaluationsBenchmark / mode
Score
Rank/total
Math and Reasoning
12 evaluationsBenchmark / mode
Score
Rank/total
Writing and Creative Capabilities
1 evaluationsBenchmark / mode
Score
Rank/total
AI Agent - Tool Usage
4 evaluationsBenchmark / mode
Score
Rank/total
Agent Level Benchmark
4 evaluationsBenchmark / mode
Score
Rank/total
Claw-style Agent Evaluation
2 evaluationsBenchmark / mode
Score
Rank/total