GPT-5 vs GPT-4o(2025-03-27) 评测对比

See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

GPT-5

OpenAI

Release: 2025-08-07
Context length: 400K
Parameters: Not provided
最大输出: 131,072 tokens
支持模态: 常规模式（Non-Thinking Mode） · 思考模式（Thinking Mode） · 深度思考（Deeper Thinking Mode）

Model profile·Playground

Best overall

GPT-5 · 84.20

Best single

GPT-5 · AIME2025 99.60

Modality coverage

GPT-5 · 2 modalities

Head to head

GPT-5

GPT-4o(2025-03-27)

AheadTiedBehind

Benchmarks

Wins

Losses

+50.07

Average diff

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Filter: Best Available·2 modes · 3 Benchmark

图表加载中...

Benchmark score table

Complete scores for each model/mode across selected benchmarks.

3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

Benchmark	GPT-5	GPT-4o(2025-03-27)
ARC-AGI 综合评估	65.70Thinking Level · High	8.80Standard Mode
GPQA Diamond 综合评估	87.30Thinking Enabled ｜ Tools	66.90Standard Mode
AIME2025 数学推理	99.60Thinking Enabled ｜ Tools	26.70Standard Mode

API price comparison

Side-by-side input/output token pricing

Detailed feature breakdown

Licensing, MoE architecture, and multi-modality support.

Features & specs	GPT-5OpenAI	GPT-4o(2025-03-27)OpenAI
Core specsRelease	2025-08-07	2025-03-27
Context length	400K	128K
Max output	131072	4096
MoE	No	No
Supported modes	常规模式（Non-Thinking Mode）思考模式（Thinking Mode）深度思考（Deeper Thinking Mode）	常规模式（Non-Thinking Mode）
LicenseCode Open Source	Not provided	Not provided
Weights Open Source	Not provided	Not provided
Commercial use	不开源	不开源
Modality supportText Input/Output	/	/
Image Input/Output	/	/
ResourcesPaper / report	Introducing GPT-5	ChatGPT — Release Notes: March 27, 2025
DataLearner blog	OpenAI发布GPT-5：这是一个包含实时路由的AI系统，而不仅仅是一个模型	Not provided