Latest AI Insights

Model Leaderboards

Model Directory

Model Comparison

Resource Center

LanguageEnglish

Search blog

DataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

Leaderboards
Model comparison
Datasets

Resources

Tutorials
Editorial
Tool directory

Company

About
Privacy policy
Data methodology
Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policy Terms of service

Claude 3.5 Sonnet Benchmark Results & Rankings | DataLearnerAI

Page navigation

Page navigation

Model catalogClaude 3.5 SonnetBenchmark analysis

Claude 3.5 Sonnet Benchmark Details

Claude 3.5 Sonnet currently shows benchmark results led by HumanEval (5 / 39, score 92), MMLU (18 / 65, score 88.30), MATH (18 / 42, score 71.10).

Benchmark Results

Claude 3.5 Sonnet

Benchmark Results

Thinking

General Knowledge

3 evaluations

Benchmark / mode

Score

Rank/total

88.30

18 / 65

77.64

74 / 126

59.40

140 / 177

Coding and Software Engineer

1 evaluations

Benchmark / mode

Score

Rank/total

92

5 / 39

Math and Reasoning

3 evaluations

Benchmark / mode

Score

Rank/total

71.10

18 / 42

1

52 / 60

FrontierMath - Tier 4

Standard Mode

0

72 / 80

Compare with other models