Claude Sonnet 4.5vsClaude 3.5 Sonnet New
Across 6 shared benchmarks, Claude Sonnet 4.5 leads overall: Claude Sonnet 4.5 wins 6, Claude 3.5 Sonnet New wins 0, with 0 ties and an average score difference of +12.87.
Claude Sonnet 4.5
Anthropic · 2025-09-30 · AI model
Claude 3.5 Sonnet New
Anthropic · 2024-10-22 · AI model
Claude Sonnet 4.56 wins(100%)(0%)0 winsClaude 3.5 Sonnet New
Benchmark scores
Grouped by capability, sorted by largest gap within each. 6 shared benchmarks.
Coding and Software Engineer
Claude Sonnet 4.5 2/2| Benchmark | Claude Sonnet 4.5 | Claude 3.5 Sonnet New | Diff |
|---|---|---|---|
| SWE-bench Verified | 823 / 103parallel_thinking + 使用工具 | 4988 / 103 | +33 |
| LiveCodeBench | 5969 / 118 | 38.70100 / 118 | +20.30 |
General Knowledge
Claude Sonnet 4.5 2/2| Benchmark | Claude Sonnet 4.5 | Claude 3.5 Sonnet New | Diff |
|---|
Specs
| Field | Claude Sonnet 4.5 | Claude 3.5 Sonnet New |
|---|---|---|
| Publisher | Anthropic | Anthropic |
| Release date | 2025-09-30 | 2024-10-22 |
| Model type | AI model | AI model |
| Architecture | Dense | Dense |
| Parameters | 0.0 | 0.0 |
| Context length | 1000K | 200K |
| Max output | 65536 | Not available |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | Claude Sonnet 4.5 | Claude 3.5 Sonnet New |
|---|---|---|
| Text input | 3 美元/100 万tokens | Not public |
| Text output | 15 美元/100 万tokens | Not public |
| Cache read | 3.75 美元/100 万tokens | Not public |
| Cache write | 0.3 美元/100 万tokens | Not public |
One or both models have incomplete public pricing.
Summary
- Claude Sonnet 4.5leads in:Coding and Software Engineer (2/2), General Knowledge (2/2), Math and Reasoning (2/2)
On average across the 6 shared benchmarks, Claude Sonnet 4.5 scores 12.87 higher.
Largest single-benchmark gap: SWE-bench Verified — Claude Sonnet 4.5 82 vs Claude 3.5 Sonnet New 49 (+33).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.