GPT-5.2 ProvsOpus 4.5
Across 5 shared benchmarks, GPT-5.2 Pro leads overall: GPT-5.2 Pro wins 5, Opus 4.5 wins 0, with 0 ties and an average score difference of +13.44.
GPT-5.2 Pro
OpenAI · 2025-12-11 · Reasoning model
Opus 4.5
Anthropic · 2025-11-25 · Reasoning model
GPT-5.2 Pro5 wins(100%)(0%)0 winsOpus 4.5
Benchmark scores
Grouped by capability, sorted by largest gap within each. 5 shared benchmarks.
General Knowledge
GPT-5.2 Pro 4/4| Benchmark | GPT-5.2 Pro | Opus 4.5 | Diff |
|---|---|---|---|
| ARC-AGI-2 | 54.2019 / 58thinking | 37.6025 / 58Extended (no tools) | +16.60 |
| ARC-AGI | 90.5015 / 65thinking | 8021 / 65Extended (no tools) | +10.50 |
| HLE | 5018 / 149thinking + 使用工具 | 43.2034 / 149 |
Specs
| Field | GPT-5.2 Pro | Opus 4.5 |
|---|---|---|
| Publisher | OpenAI | Anthropic |
| Release date | 2025-12-11 | 2025-11-25 |
| Model type | Reasoning model | Reasoning model |
| Architecture | Dense | Dense |
| Parameters | 0.0 | 0.0 |
| Context length | 256K | 200K |
| Max output | Not available | 65536 |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | GPT-5.2 Pro | Opus 4.5 |
|---|---|---|
| Text input | $21.00 / 1M tokens | $5 / 1M tokens |
| Text output | $168.00 / 1M tokens | $25 / 1M tokens |
| Cache read | Not public | $0.5 / 1M tokens |
| Cache write | Not public | $6.25 / 1M tokens |
Summary
- GPT-5.2 Proleads in:General Knowledge (4/4), Math and Reasoning (1/1)
On average across the 5 shared benchmarks, GPT-5.2 Pro scores 13.44 higher.
Largest single-benchmark gap: FrontierMath - Tier 4 — GPT-5.2 Pro 31.30 vs Opus 4.5 4.20 (+27.10).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.