Haiku 4.5vsClaude 3.5 Haiku
Across 3 shared benchmarks, Haiku 4.5 leads overall: Haiku 4.5 wins 3, Claude 3.5 Haiku wins 0, with 0 ties and an average score difference of +11.23.
Haiku 4.5
Anthropic · 2025-10-15 · Multimodal model
Claude 3.5 Haiku
Anthropic · 2024-10-22 · Foundation model
Haiku 4.53 wins(100%)(0%)0 winsClaude 3.5 Haiku
Benchmark scores
Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.
General Knowledge
Haiku 4.5 2/2| Benchmark | Haiku 4.5 | Claude 3.5 Haiku | Diff |
|---|---|---|---|
| GPQA Diamond | 60.50135 / 175Normal (No Tools) | 41.60159 / 175 | +18.90 |
| MMLU Pro | 7676 / 124Normal (No Tools) | 6599 / 124 | +11 |
Math and Reasoning
Haiku 4.5 1/1| Benchmark | Haiku 4.5 | Claude 3.5 Haiku | Diff |
|---|
Specs
| Field | Haiku 4.5 | Claude 3.5 Haiku |
|---|---|---|
| Publisher | Anthropic | Anthropic |
| Release date | 2025-10-15 | 2024-10-22 |
| Model type | Multimodal model | Foundation model |
| Architecture | Dense | Dense |
| Parameters | 0.0 | 0.0 |
| Context length | 200K | 200K |
| Max output | 65536 | Not available |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | Haiku 4.5 | Claude 3.5 Haiku |
|---|---|---|
| Text input | 1 美元 / 100万 tokens | Not public |
| Text output | 5 美元 / 100万 tokens | Not public |
| Cache read | 1.25 美元 / 100万 tokens | Not public |
| Cache write | 0.10 美元 / 100万 tokens | Not public |
One or both models have incomplete public pricing.
Summary
- Haiku 4.5leads in:General Knowledge (2/2), Math and Reasoning (1/1)
On average across the 3 shared benchmarks, Haiku 4.5 scores 11.23 higher.
Largest single-benchmark gap: GPQA Diamond — Haiku 4.5 60.50 vs Claude 3.5 Haiku 41.60 (+18.90).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.