DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeModel CompareGPT-5.5 vs GPT-5.1 评测对比

GPT-5.5 vs GPT-5.1 评测对比

See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

OpenAI

GPT-5.5

OpenAI

Release
2026-04-23
Context length
1000K
Parameters
Not provided
最大输出
131,072 tokens
Model profile·Playground
OpenAI

GPT-5.1

OpenAI

Release
2025-11-12
Context length
400K
Parameters
Not provided
最大输出
131,072 tokens
支持模态
常规模式(Non-Thinking Mode) · 思考模式(Thinking Mode)
Model profile·Playground
Loading comparison...

Best overall

GPT-5.5 · 70.08

Best single

GPT-5.5 · ARC-AGI 95.00

Modality coverage

GPT-5.5 · 2 modalities

Head to head

GPT-5.5
4
1
GPT-5.1
AheadTiedBehind

5

Benchmarks

4

Wins

1

Losses

+23.34

Average diff

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Thinking
Tool usage
Internet
Filter: Best Available·2 modes · 5 Benchmark
图表加载中...

Benchmark score table

Complete scores for each model/mode across selected benchmarks.

5 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

BenchmarkGPT-5.5GPT-5.1
ARC-AGI
综合评估
95.00Thinking Level · Extra High
72.80Thinking Level · High
ARC-AGI-2
综合评估
85.00Thinking Level · Extra High
17.60Thinking Level · High
GPQA Diamond
综合评估
93.60Thinking Level · High
88.10Thinking Enabled
HLE
综合评估
41.40Thinking Level · High
42.70Thinking Level · High | Tools
FrontierMath - Tier 4
数学推理
35.40Thinking Level · Extra High
12.50Thinking Level · High | Tools

API price comparison

Side-by-side input/output token pricing

Detailed feature breakdown

Licensing, MoE architecture, and multi-modality support.

Features & specs
GPT-5.5OpenAI
GPT-5.1OpenAI
Core specsRelease
2026-04-232025-11-12
Context length
1000K400K
Max output
131072131072
MoE
NoNo
Supported modes
No mode data
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)
LicenseCode Open Source
Not providedNot provided
Weights Open Source
Not providedNot provided
Commercial use
不开源不开源
Modality supportText Input/Output
/
/
Image Input/Output
/
/
ResourcesPaper / report
Introducing GPT‑5.5GPT-5.1: A smarter, more conversational ChatGPT
DataLearner blog
OpenAI 发布 GPT-5.5:代号OpenAI发布GPT-5.1:围绕“对话体验、一致性、任务适配性”进行的系统化优化的小幅更新!