DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
目录
Model catalogClaude Sonnet 3.7-64K Extended ThinkingBenchmark analysis

Claude Sonnet 3.7-64K Extended Thinking Benchmark Details

Claude Sonnet 3.7-64K Extended Thinking currently shows benchmark results led by GPQA Diamond (39 / 166, score 84.80), MATH-500 (19 / 43, score 96.20), AIME 2024 (28 / 62, score 80).

Benchmark Results

Claude Sonnet 3.7-64K Extended Thinking

Benchmark Results

Thinking
All modesNormal

综合评估

1 evaluations
Benchmark / mode
Score
Rank/total
GPQA Diamond
normal
84.80
39 / 166

数学推理

2 evaluations
Benchmark / mode
Score
Rank/total
MATH-500
normal
96.20
19 / 43
AIME 2024
normal
80
28 / 62
Compare with other models