GPT-5.5 vs GPT-5.1 评测对比

See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

GPT-5.5

OpenAI

Release: 2026-04-23
Context length: 1000K
Parameters: Not provided
最大输出: 131,072 tokens

Model profile·Playground

GPT-5.1

OpenAI

Release: 2025-11-12
Context length: 400K
Parameters: Not provided
最大输出: 131,072 tokens
支持模态: 常规模式（Non-Thinking Mode） · 思考模式（Thinking Mode）

Model profile·Playground

Loading comparison...

GPT-5.5 vs GPT-5.1 评测对比

See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

GPT-5.5

OpenAI

Release: 2026-04-23
Context length: 1000K
Parameters: Not provided
最大输出: 131,072 tokens

Model profile·Playground

GPT-5.1

OpenAI

Release: 2025-11-12
Context length: 400K
Parameters: Not provided
最大输出: 131,072 tokens
支持模态: 常规模式（Non-Thinking Mode） · 思考模式（Thinking Mode）

Model profile·Playground

Best overall

GPT-5.5 · 70.08

Best single

GPT-5.5 · ARC-AGI 95.00

Modality coverage

GPT-5.5 · 2 modalities

Head to head

GPT-5.5

GPT-5.1

AheadTiedBehind

Benchmarks

Wins

Losses

+23.34

Average diff

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Filter: Best Available·2 modes · 5 Benchmark

图表加载中...

Benchmark score table

Complete scores for each model/mode across selected benchmarks.

5 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

Benchmark	GPT-5.5	GPT-5.1
ARC-AGI 综合评估	95.00Thinking Level · Extra High	72.80Thinking Level · High
ARC-AGI-2 综合评估	85.00Thinking Level · Extra High	17.60Thinking Level · High
GPQA Diamond 综合评估	93.60Thinking Level · High	88.10Thinking Enabled
HLE 综合评估	41.40Thinking Level · High	42.70Thinking Level · High ｜ Tools
FrontierMath - Tier 4 数学推理	35.40Thinking Level · Extra High	12.50Thinking Level · High ｜ Tools

API price comparison

Side-by-side input/output token pricing

Detailed feature breakdown

Licensing, MoE architecture, and multi-modality support.

Features & specs	GPT-5.5OpenAI	GPT-5.1OpenAI
Core specsRelease	2026-04-23	2025-11-12
Context length	1000K	400K
Max output	131072	131072
MoE	No	No
Supported modes	No mode data	常规模式（Non-Thinking Mode）思考模式（Thinking Mode）
LicenseCode Open Source	Not provided	Not provided
Weights Open Source	Not provided	Not provided
Commercial use	不开源	不开源
Modality supportText Input/Output	/	/
Image Input/Output	/	/
ResourcesPaper / report	Introducing GPT‑5.5	GPT-5.1: A smarter, more conversational ChatGPT
DataLearner blog	OpenAI 发布 GPT-5.5：代号	OpenAI发布GPT-5.1：围绕“对话体验、一致性、任务适配性”进行的系统化优化的小幅更新！

Loading comparison...

Best overall

GPT-5.5 · 70.08

Best single

GPT-5.5 · ARC-AGI 95.00

Modality coverage

GPT-5.5 · 2 modalities

Head to head

GPT-5.5

GPT-5.1

AheadTiedBehind

Benchmarks

Wins

Losses

+23.34

Average diff

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Filter: Best Available·2 modes · 5 Benchmark

图表加载中...

Benchmark score table

Complete scores for each model/mode across selected benchmarks.

5 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

Benchmark	GPT-5.5	GPT-5.1
ARC-AGI 综合评估	95.00Thinking Level · Extra High	72.80Thinking Level · High
ARC-AGI-2 综合评估	85.00Thinking Level · Extra High	17.60Thinking Level · High
GPQA Diamond 综合评估	93.60Thinking Level · High	88.10Thinking Enabled
HLE 综合评估	41.40Thinking Level · High	42.70Thinking Level · High ｜ Tools
FrontierMath - Tier 4 数学推理	35.40Thinking Level · Extra High	12.50Thinking Level · High ｜ Tools

API price comparison

Side-by-side input/output token pricing

Detailed feature breakdown

Licensing, MoE architecture, and multi-modality support.

Features & specs	GPT-5.5OpenAI	GPT-5.1OpenAI
Core specsRelease	2026-04-23	2025-11-12
Context length	1000K	400K
Max output	131072	131072
MoE	No	No
Supported modes	No mode data	常规模式（Non-Thinking Mode）思考模式（Thinking Mode）
LicenseCode Open Source	Not provided	Not provided
Weights Open Source	Not provided	Not provided
Commercial use	不开源	不开源
Modality supportText Input/Output	/	/
Image Input/Output	/	/
ResourcesPaper / report	Introducing GPT‑5.5	GPT-5.1: A smarter, more conversational ChatGPT
DataLearner blog	OpenAI 发布 GPT-5.5：代号	OpenAI发布GPT-5.1：围绕“对话体验、一致性、任务适配性”进行的系统化优化的小幅更新！