Claude Sonnet 3.7 Benchmark Details
Claude Sonnet 3.7 currently shows benchmark results led by SWE-bench Verified (40 / 93, score 70.30), LiveBench (23 / 51, score 68.64), GPQA Diamond (74 / 162, score 77). 1 source link is attached for reference.
Benchmark Results
Claude Sonnet 3.7
Benchmark Results
综合评估
3 evaluationsBenchmark / mode
Score
Rank/total