Claude Sonnet 3.7 Benchmark Details
Claude Sonnet 3.7 currently shows benchmark results led by LiveBench (24 / 52, score 68.64), GPQA Diamond (89 / 179, score 77), SWE-bench Verified (55 / 108, score 70.30). 1 source link is attached for reference.
Benchmark Results
Claude Sonnet 3.7
Benchmark Results
General Knowledge
5 evaluationsBenchmark / mode
Score
Rank/total
Coding and Software Engineer
2 evaluationsBenchmark / mode
Score
Rank/total
Math and Reasoning
5 evaluationsBenchmark / mode
Score
Rank/total
Agent Level Benchmark
5 evaluationsBenchmark / mode
Score
Rank/total