Claude 3.5 Sonnet New Benchmark Details
Claude 3.5 Sonnet New currently shows benchmark results led by HumanEval (3 / 39, score 93.70), BBH (2 / 20, score 92.60), MMLU (18 / 65, score 88.30).
Benchmark Results
Claude 3.5 Sonnet New
Benchmark Results
General Knowledge
4 evaluationsBenchmark / mode
Score
Rank/total
Coding and Software Engineer
3 evaluationsBenchmark / mode
Score
Rank/total
Math and Reasoning
5 evaluationsBenchmark / mode
Score
Rank/total
Writing and Creative Capabilities
1 evaluationsBenchmark / mode
Score
Rank/total