加载中...

GPT-5.4

Name: GPT-5.4
Author: OpenAI

Release date: 2026-03-05更新于: 2026-03-10 08:31:15知识截止: 2025-08520

Live demoGitHubHugging FaceCompare

Parameters

Not disclosed

Context length

Chinese support

Supported

Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

GPT-5.4

Model basics

Reasoning traces

Supported

Thinking modes

Thinking Level · Extra-High (Default)Thinking Level · LowThinking Level · MediumThinking Level · High

Context length

1M tokens

Max output length

128000 tokens

Model type

多模态大模型

Release date

2026-03-05

Model file size

No data

MoE architecture

Total params / Active params

0.0B / N/A

Knowledge cutoff

2025-08

GPT-5.4

Open source & experience

Code license

不开源

Weights license

不开源- 不开源

GitHub repo

GitHub link unavailable

Hugging Face

Hugging Face link unavailable

Live demo

https://chatgpt.com/

GPT-5.4

Official resources

Paper

Introducing GPT‑5.4

DataLearnerAI blog

No blog post yet

GPT-5.4

API details

API speed

3/5

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Learn about pricing modes

Standard

Type	Condition	Input	Output
Text	Context <= 272K	$2.50/ 1M	$15.00/ 1M
Text	Context > 272K	$5.00/ 1M	$22.50/ 1M

Cache PricingPrompt Cache

Type	TTL	Write	Read
Text	5m	$0.250/ 1M	-

GPT-5.4

Benchmark Results

GPT-5.4 currently shows benchmark results led by HLE (3 / 113, score 52.10), GPQA Diamond (6 / 160, score 92.80), SWE-Bench Pro - Public (1 / 19, score 57.70). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

综合评估

1 evaluations

Benchmark / mode

Score

Rank/total

HLE

Extra-HighWith tools

52.10

3 / 113

Agent能力评测

3 evaluations

Benchmark / mode

Score

Rank/total

τ²-Bench - Telecom

OffWith tools

64.30

28 / 33

τ²-Bench - Telecom

Extra-HighWith tools

98.90

3 / 33

Pinch Bench

HighWith tools

86.40

2 / 5

AI Agent - 信息收集

1 evaluations

Benchmark / mode

Score

Rank/total

BrowseComp

Extra-HighWith tools

82.70

6 / 32

AI Agent - 工具使用

2 evaluations

Benchmark / mode

Score

Rank/total

Terminal Bench 2.0

Extra-HighWith tools

75.10

2 / 26

OSWorld-Verified

Extra-HighWith tools

1 / 10

View benchmark analysis Compare with other models

GPT-5.4

Publisher

OpenAI

View publisher details

GPT-5.4

Model Overview

GPT-5.4模型概述

GPT-5.4 是 OpenAI 于 2026 年 3 月发布的 GPT 系列大型语言模型版本之一。该模型定位为面向专业工作场景的前沿基础模型，旨在提升复杂知识任务、软件开发以及长上下文分析等场景中的能力与效率。公开报道指出，GPT-5.4 在多个能力维度上进行了优化，包括上下文处理能力、推理效率以及复杂任务完成能力。

在产品定位上，GPT-5.4 属于 GPT-5 系列的迭代版本，并在发布时提供多个不同配置的变体，包括面向推理任务的 GPT-5.4 Thinking 版本以及面向企业级生产环境的 GPT-5.4 Pro 版本。

架构与技术规格

截至目前，OpenAI 尚未公开 GPT-5.4 的完整模型架构细节，例如参数规模或训练数据规模。与 GPT 系列的其他商业模型类似，该模型采用专有架构并未开源。

公开资料表明，该模型在 API 版本中支持最高约 1,000,000 tokens 的上下文窗口，这使其能够处理远超传统语言模型的长文档输入，例如大型代码库、研究文献或长周期任务上下文。

社区分析及部分技术报道还指出，开发过程中的代码提交信息曾显示更大规模上下文窗口的实验版本，相关讨论中提到最高可能达到约 2M tokens 的实验能力，但该信息并未在官方发布文档中确认。

核心能力与支持模态

GPT-5.4 是一类多模态基础模型，能够处理文本和图像等多种输入形式。相关技术更新提到，该模型支持更高精度的图像输入处理能力，例如在 API 中支持直接传入原始图像字节进行分析，这对于需要高分辨率视觉理解的应用场景具有重要意义。

在文本能力方面，该模型针对复杂知识工作任务进行了优化，例如多步骤推理、软件开发辅助、信息检索与整合等场景。

性能与基准评测

公开报道显示，GPT-5.4 在多个内部评测任务中取得了较高表现。例如在面向知识工作任务的 GDPval 评测中取得约 83% 的成绩，并在 OSWorld-Verified 和 WebArena Verified 等计算机使用相关评测中取得新的记录成绩。

此外，官方资料指出该模型在事实性错误方面相比 GPT-5.2 有明显下降，错误率降低约 33%。

应用场景

由于其长上下文能力和多模态能力，GPT-5.4 被用于多种复杂应用场景，例如：

大型代码库理解与软件工程辅助
复杂知识工作任务与研究分析
长文档处理与信息总结
自动化任务与智能代理系统

访问方式与许可

GPT-5.4 为专有商业模型，目前主要通过 OpenAI 提供的 API 以及相关开发工具平台访问。模型权重与训练数据未公开发布。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送

加载中...

GPT-5.4

Release date: 2026-03-05更新于: 2026-03-10 08:31:15知识截止: 2025-08520

Live demoGitHubHugging FaceCompare

Parameters

Not disclosed

Context length

Chinese support

Supported

Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

GPT-5.4

Model basics

Reasoning traces

Supported

Thinking modes

Thinking Level · Extra-High (Default)Thinking Level · LowThinking Level · MediumThinking Level · High

Context length

1M tokens

Max output length

128000 tokens

Model type

多模态大模型

Release date

2026-03-05

Model file size

No data

MoE architecture

Total params / Active params

0.0B / N/A

Knowledge cutoff

2025-08

GPT-5.4

Open source & experience

Code license

不开源

Weights license

不开源- 不开源

GitHub repo

GitHub link unavailable

Hugging Face

Hugging Face link unavailable

Live demo

https://chatgpt.com/

GPT-5.4

Official resources

Paper

Introducing GPT‑5.4

DataLearnerAI blog

No blog post yet

GPT-5.4

API details

API speed

3/5

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Learn about pricing modes

Standard

Type	Condition	Input	Output
Text	Context <= 272K	$2.50/ 1M	$15.00/ 1M
Text	Context > 272K	$5.00/ 1M	$22.50/ 1M

Cache PricingPrompt Cache

Type	TTL	Write	Read
Text	5m	$0.250/ 1M	-

GPT-5.4

Benchmark Results

综合评估

1 evaluations

Benchmark / mode

Score

Rank/total

HLE

Extra-HighWith tools

52.10

3 / 113

Agent能力评测

3 evaluations

Benchmark / mode

Score

Rank/total

τ²-Bench - Telecom

OffWith tools

64.30

28 / 33

τ²-Bench - Telecom

Extra-HighWith tools

98.90

3 / 33

Pinch Bench

HighWith tools

86.40

2 / 5

AI Agent - 信息收集

1 evaluations

Benchmark / mode

Score

Rank/total

BrowseComp

Extra-HighWith tools

82.70

6 / 32

AI Agent - 工具使用

2 evaluations

Benchmark / mode

Score

Rank/total

Terminal Bench 2.0

Extra-HighWith tools

75.10

2 / 26

OSWorld-Verified

Extra-HighWith tools

1 / 10

View benchmark analysis Compare with other models

GPT-5.4

Publisher

OpenAI

View publisher details

GPT-5.4

Model Overview

GPT-5.4模型概述

架构与技术规格

核心能力与支持模态

在文本能力方面，该模型针对复杂知识工作任务进行了优化，例如多步骤推理、软件开发辅助、信息检索与整合等场景。

性能与基准评测

此外，官方资料指出该模型在事实性错误方面相比 GPT-5.2 有明显下降，错误率降低约 33%。

应用场景

由于其长上下文能力和多模态能力，GPT-5.4 被用于多种复杂应用场景，例如：

大型代码库理解与软件工程辅助
复杂知识工作任务与研究分析
长文档处理与信息总结
自动化任务与智能代理系统

访问方式与许可

GPT-5.4 为专有商业模型，目前主要通过 OpenAI 提供的 API 以及相关开发工具平台访问。模型权重与训练数据未公开发布。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送