GPT-5.2 Pro Benchmark Details
GPT-5.2 Pro currently shows benchmark results led by GPQA Diamond (8 / 179, score 93.20), FrontierMath - Tier 4 (9 / 80, score 31.30), HLE (23 / 159, score 50). This page also compares it with 2 competitor models and 2 predecessor or same-series models, including performance and pricing views when available. 1 source link is attached for reference.
Benchmark Results
Benchmark Results
General Knowledge
5 evaluationsMath and Reasoning
2 evaluationsAI Agent - Information Search
2 evaluationsCompetitor Comparison
Benchmark scores for GPT-5.2 Pro compared against top models in its class
3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.
| Benchmark | GPT-5.2 ProCurrent | Opus 4.5 |
|---|---|---|
HLE 综合评估 | 50.00Thinking Enabled | Tools | 43.20Extended Thinking | Tools |
Simple Bench 常识推理 | 57.40Thinking Level · Extra High | 62.00Extended Thinking |
31.30Thinking Enabled | 4.20Standard Mode |
Standard API Pricing: GPT-5.2 Pro vs. Peer Models
Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.
Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens
| Model | Supplier | Standard input | Standard output | Base price applies to |
|---|---|---|---|---|
Opus 4.5 | Facebook AI研究实验室 | $5 / 1M tokens | $25 / 1M tokens | — |
Version History
How each version of the GPT-5.2 Pro series stacks up on benchmark tests
6 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.· Click a row to view its trend chart.
| Benchmark | GPT-5.2 ProCurrent | GPT-5-Pro |
|---|---|---|
ARC-AGI 综合评估 | 90.50Thinking Enabled | 70.20Thinking Enabled |
ARC-AGI-2 综合评估 | 54.20Thinking Enabled | 18.00Thinking Enabled |
GPQA Diamond 综合评估 | 93.20Thinking Enabled | 89.40Thinking Enabled | Tools |
HLE 综合评估 | 50.00Thinking Enabled | Tools | 42.00Thinking Enabled | Tools |
Simple Bench 常识推理 | 57.40Thinking Level · Extra High | 61.60Thinking Enabled |
31.30Thinking Enabled | 14.60Thinking Enabled |
Single-Benchmark Version Trend
Viewing: ARC-AGI · 综合评估
Standard API Pricing Across the GPT-5.2 Pro Series
Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.
Source: DataLearnerAI. Standard text prices shown here use the default supplier.