Latest AI Insights

Model Leaderboards

Model Directory

Model Comparison

Resource Center

LanguageEnglish

Search blog

DataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

Leaderboards
Model comparison
Datasets

Resources

Tutorials
Editorial
Tool directory

Company

About
Privacy policy
Data methodology
Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policy Terms of service

Claude Sonnet 3.7-64K Extended Thinking Benchmark Results & Rankings | DataLearnerAI

Page navigation

Page navigation

Model catalogClaude Sonnet 3.7-64K Extended ThinkingBenchmark analysis

Claude Sonnet 3.7-64K Extended Thinking Benchmark Details

Claude Sonnet 3.7-64K Extended Thinking currently shows benchmark results led by GPQA Diamond (49 / 177, score 84.80), MATH-500 (19 / 44, score 96.20), AIME 2024 (27 / 62, score 80).

Benchmark Results

Claude Sonnet 3.7-64K Extended Thinking

Benchmark Results

Thinking

综合评估

1 evaluations

Benchmark / mode

Score

Rank/total

Standard Mode

84.80

49 / 177

数学推理

2 evaluations

Benchmark / mode

Score

Rank/total

Standard Mode

96.20

19 / 44

Standard Mode

80

27 / 62

Compare with other models