See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

GPT-5
OpenAI
Best overall
GPT-5 · 84.20
Best single
GPT-5 · AIME2025 99.60
Modality coverage
GPT-5 · 2 modalities
Head to head
3
Benchmarks
3
Wins
0
Losses
+50.07
Average diff
Compare benchmark results across thinking modes and tool usage.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Complete scores for each model/mode across selected benchmarks.
3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.
| Benchmark | GPT-5 | GPT-4o(2025-03-27) |
|---|---|---|
ARC-AGI 综合评估 | 65.70Thinking Level · High | 8.80Standard Mode |
GPQA Diamond 综合评估 | 87.30Thinking Enabled | Tools | 66.90Standard Mode |
AIME2025 数学推理 | 99.60Thinking Enabled | Tools | 26.70Standard Mode |
Side-by-side input/output token pricing
Licensing, MoE architecture, and multi-modality support.
| Features & specs | GPT-5OpenAI | GPT-4o(2025-03-27)OpenAI |
|---|---|---|
Core specsRelease | 2025-08-07 | 2025-03-27 |
Context length | 400K | 128K |
Max output | 131072 | 4096 |
MoE | No | No |
Supported modes | 常规模式(Non-Thinking Mode)思考模式(Thinking Mode)深度思考(Deeper Thinking Mode) | 常规模式(Non-Thinking Mode) |
LicenseCode Open Source | Not provided | Not provided |
Weights Open Source | Not provided | Not provided |
Commercial use | 不开源 | 不开源 |
Modality supportText Input/Output | / | / |
Image Input/Output | / | / |
ResourcesPaper / report | Introducing GPT-5 | ChatGPT — Release Notes: March 27, 2025 |
DataLearner blog | OpenAI发布GPT-5:这是一个包含实时路由的AI系统,而不仅仅是一个模型 | Not provided |

GPT-4o(2025-03-27)
OpenAI