DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeAI ModelsClaude Sonnet 4.5 vs Claude 3.5 Sonnet

Claude Sonnet 4.5vsClaude 3.5 Sonnet

Across 4 shared benchmarks, Claude Sonnet 4.5 leads overall: Claude Sonnet 4.5 wins 4, Claude 3.5 Sonnet wins 0, with 0 ties and an average score difference of +7.74.

Anthropic
Claude Sonnet 4.5

Anthropic · 2025-09-30 · AI model

Anthropic
Claude 3.5 Sonnet

Anthropic · 2024-06-21 · Multimodal model

Claude Sonnet 4.54 wins(100%)(0%)0 winsClaude 3.5 Sonnet

Benchmark scores

Grouped by capability, sorted by largest gap within each. 4 shared benchmarks.

General Knowledge

Claude Sonnet 4.5 2/2
BenchmarkClaude Sonnet 4.5Claude 3.5 SonnetDiff
GPQA Diamond73.7094 / 17559.40138 / 175+14.30
MMLU Pro885 / 124thinking77.6472 / 124+10.36

Math and Reasoning

Claude Sonnet 4.5 2/2
BenchmarkClaude Sonnet 4.5Claude 3.5 SonnetDiff
FrontierMath5.2038 / 60152 / 60+4.20
FrontierMath - Tier 42.1056 / 80Normal (No Tools)072 / 80Normal (No Tools)+2.10

Specs

FieldClaude Sonnet 4.5Claude 3.5 Sonnet
PublisherAnthropicAnthropic
Release date2025-09-302024-06-21
Model typeAI modelMultimodal model
ArchitectureDenseDense
Parameters0.0Not available
Context length1000K200K
Max output65536Not available

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

ItemClaude Sonnet 4.5Claude 3.5 Sonnet
Text input3 美元/100 万tokensNot public
Text output15 美元/100 万tokensNot public
Cache read3.75 美元/100 万tokensNot public
Cache write0.3 美元/100 万tokensNot public

One or both models have incomplete public pricing.

Summary

  • Claude Sonnet 4.5leads in:General Knowledge (2/2), Math and Reasoning (2/2)

On average across the 4 shared benchmarks, Claude Sonnet 4.5 scores 7.74 higher.

Largest single-benchmark gap: GPQA Diamond — Claude Sonnet 4.5 73.70 vs Claude 3.5 Sonnet 59.40 (+14.30).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.

Claude Sonnet 4.5 detailsClaude 3.5 Sonnet details·Customize in compare tool