DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
Page navigation
Model catalogGPT-5.5
GP

GPT-5.5

Reasoning modelGPT-5.5

GPT-5.5

Release date: 2026-04-23Updated: 2026-05-02 13:16:40.7245,853
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
1000K
Chinese support
Supported
Reasoning ability

GPT-5.5(代号 Spud)是 OpenAI 于 2026 年 4 月发布的旗舰推理模型,专为 Agent 编程、计算机操控与知识工作设计,支持 100 万 token 上下文。本页收录完整基准评测、API 定价与模型解读。

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

GPT-5.5

Model basics

Reasoning traces
Supported
Thinking modes
Thinking Level · Extra-High (Default)Standard ModeThinking Level · LowThinking Level · MediumThinking Level · HighThinking Level · Max
Context length
1000K tokens
Max output length
128K tokens
Model type
Reasoning model
Release date
2026-04-23
Model file size
No data
MoE architecture
No
Total params / Active params
No data / N/A
Knowledge cutoff
No data
GPT-5.5

Open source & experience

Code license
不开源
Weights license
不开源
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Live demo
https://chatgpt.com/
GPT-5.5

Official resources

Paper
Introducing GPT‑5.5
DataLearnerAI blog
DataLearnerAI blog
GPT-5.5

API details

API speed
3/5
💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.
Learn about pricing modes
Standard
TypeConditionInputOutput
Text-$5.00/ 1M$30.00/ 1M
Batch
TypeConditionInputOutput
Text-$2.50/ 1M$15.00/ 1M
Turbo
TypeConditionInputOutput
Text-$12.50/ 1M$75.00/ 1M
Cache PricingPrompt Cache
TypeTTLWriteRead
Text--$0.500/ 1M
Text5m$6.25/ 1M$0.500/ 1M
Text1h$6.25/ 1M$0.500/ 1M
GPT-5.5

Benchmark Results

GPT-5.5 currently shows benchmark results led by ARC-AGI-2 (1 / 59, score 85), Terminal Bench 2.0 (1 / 46, score 82.70), FrontierMath (2 / 60, score 51.70). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking
Tool usage
Internet

General Knowledge

12 evaluations
Benchmark / mode
Score
Rank/total
ARC-AGI
Low
76.20
23 / 65
ARC-AGI
Medium
92.20
10 / 65
ARC-AGI
High
94.50
5 / 65
ARC-AGI
Extra-High
95
3 / 65
GPQA Diamond
High
93.60
6 / 177
ARC-AGI-2
Low
33.30
28 / 59
ARC-AGI-2
Medium
70.40
12 / 59
ARC-AGI-2
High
83.30
5 / 59
ARC-AGI-2
Extra-High
85
1 / 59
HLE
High
41.40
47 / 157
HLE
HighTools
52.20
13 / 157
ARC-AGI-3
High
0
2 / 6

Math and Reasoning

3 evaluations
Benchmark / mode
Score
Rank/total
FrontierMath
HighTools
51.70
2 / 60
FrontierMath - Tier 4
HighTools
35.40
7 / 80
FrontierMath - Tier 4
Extra-High
35.40
7 / 80

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
SWE-Bench Pro - Public
HighTools
58.60
7 / 43

Agent Level Benchmark

1 evaluations
Benchmark / mode
Score
Rank/total
τ²-Bench - Telecom
HighTools
98
5 / 35

AI Agent - Information Search

1 evaluations
Benchmark / mode
Score
Rank/total
BrowseComp
HighToolsInternet
84.40
5 / 44

AI Agent - Tool Usage

2 evaluations
Benchmark / mode
Score
Rank/total
Terminal Bench 2.0
HighTools
82.70
1 / 46
OSWorld-Verified
HighTools
78.70
5 / 19

Productivity Knowledge

1 evaluations
Benchmark / mode
Score
Rank/total
GDPval-AA
High
1769
2 / 21
View benchmark analysisCompare with other models
GPT-5.5

Publisher

OpenAI
OpenAI
View publisher details
GPT-5.5

Model Overview

GPT-5.5(代号 Spud)是 OpenAI 于 2026 年 4 月发布的旗舰推理模型,专为 Agent 编程、计算机操控与知识工作设计,支持 100 万 token 上下文。本页收录完整基准评测、API 定价与模型解读。

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code

Compare with other models

  • Earlier versionGPT-5.5 vs GPT-5.412 benchmarks
  • Peer modelGPT-5.5 vs Opus 4.711 benchmarks
  • Peer modelGPT-5.5 vs Gemini 3.1 Pro Preview10 benchmarks
  • Earlier versionGPT-5.5 vs GPT-5.210 benchmarks
  • Earlier versionGPT-5.5 vs GPT-5.110 benchmarks
  • Peer modelGPT-5.5 vs Claude Mythos Preview6 benchmarks

Want a custom combination? Open the compare tool