DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
目录
Model catalogClaude 3.5 SonnetBenchmark analysis

Claude 3.5 Sonnet Benchmark Details

Claude 3.5 Sonnet currently shows benchmark results led by HumanEval (5 / 39, score 92), MMLU (18 / 65, score 88.30), MATH (18 / 42, score 71.10).

Benchmark Results

Claude 3.5 Sonnet

Benchmark Results

Thinking
All modesNormal

综合评估

3 evaluations
Benchmark / mode
Score
Rank/total
MMLU
Standard Mode
88.30
18 / 65
MMLU Pro
Standard Mode
77.64
66 / 118
GPQA Diamond
Standard Mode
59.40
132 / 169

编程与软件工程

1 evaluations
Benchmark / mode
Score
Rank/total
HumanEval
Standard Mode
92
5 / 39

数学推理

3 evaluations
Benchmark / mode
Score
Rank/total
MATH
Standard Mode
71.10
18 / 42
FrontierMath
Standard Mode
1
48 / 56
FrontierMath - Tier 4
Standard Mode
0.01
30 / 37
Compare with other models