DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
目录
Model catalogOpenAI o3
OP

OpenAI o3

推理大模型

OpenAI o3

Release date: 2025-04-16更新于: 2025-08-08 14:11:491,257
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
200K
Chinese support
Supported
Reasoning ability

OpenAI o3 is an AI model published by OpenAI, released on 2025-04-16, for 推理大模型, with 0.0B parameters, and 200K tokens context length, under the 不开源 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

OpenAI o3

Model basics

Reasoning traces
Supported
Thinking modes
Thinking modes not supported
Context length
200K tokens
Max output length
100000 tokens
Model type
推理大模型
Release date
2025-04-16
Model file size
No data
MoE architecture
No
Total params / Active params
0.0B / N/A
Knowledge cutoff
No data
OpenAI o3

Open source & experience

Code license
不开源
Weights license
不开源- 不开源
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Live demo
https://chatgpt.com/
OpenAI o3

Official resources

Paper
Introducing OpenAI o3 and o4-mini
DataLearnerAI blog
No blog post yet
OpenAI o3

API details

API speed
1/5
💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.
Standard pricingStandard
ModalityInputOutput
Text$10$40
Image$10--
OpenAI o3

Benchmark Results

OpenAI o3 currently shows benchmark results led by Creative Writing (2 / 22, score 87.65), Aider-Polyglot (3 / 26, score 81.30), MATH-500 (5 / 43, score 98.10). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking
All modesNormalThinking
Thinking mode details (4)
All thinking modesDefault (Medium)Thinking ModeHighLow

数学推理

1 evaluations
Benchmark / mode
Score
Rank/total
FrontierMath
Medium
10
23 / 55
View benchmark analysisCompare with other models
OpenAI o3

Publisher

OpenAI
OpenAI
View publisher details
OpenAI o3

Model Overview

OpenAI o3是当前OpenAI最先进的推理大模型。作为o系列旗舰模型,该模型在复杂问题解决、跨领域分析和视觉推理任务中树立了新的性能标杆,尤其擅长需要深度逻辑推演的多步骤工作流。

核心特性

  1. 多模态推理能力o3首次实现图像与文本的联合思维链构建:支持白板草图、教科书图表等低质量视觉输入的语义解析动态图像处理功能(实时旋转/缩放/坐标系变换)在MMMU大学级视觉问题解决基准准确率达86.8%,较前代提升21%
  2. 工具链自主决策可自主编排复杂工具组合:pythonCopyDownload# 典型工作流示例:能源需求预测 web_search("加州去年夏季能源数据") >> 分析网页结果 generate_python_code("构建预测模型") >> 执行代码并可视化 create_explanatory_diagram()支持多轮搜索迭代与动态策略调整,平均问题解决时间<60秒。
  3. 跨学科推理优势Codeforces编程竞赛ELO评分2706,超越专业选手平均水平SWE-bench软件工程任务准确率69.1%,无需定制脚手架生物/数学假设生成与验证能力获领域专家认可

技术创新

  • 计算扩展定律验证:通过10倍量级的训练计算扩展,验证推理性能随计算资源持续提升的规律
  • 工具调用强化学习:训练模型自主判断工具使用时机,开放式场景处理能力提升37%
  • 记忆上下文优化:支持跨对话周期的知识引用,个性化响应相关性提升28%

性能表现

基准测试o1o3(无工具)o3(全工具)

AIME 2025数学竞赛79.2%88.9%98.4%

博士级科学问题(GPQA)8.12%20.32%24.90%

视觉数学推理(MathVista)55.1%78.6%-

代码编辑任务(Aider)64.4%81.3%-

在同等延迟条件下,o3推理深度较o1提升3.2倍,复杂问题解决成功率提高42%。

安全体系

  • 风险分类训练:新增生物威胁、越狱攻击等12类专项拒绝策略
  • 可解释监控框架:基于人类可读的安全规范构建LLM监控器,生物风险对话识别率99%
  • 三级评估体系:通过生物化学/网络安全/AI进化风险评估,所有指标低于"高危"阈值

应用部署

  • ChatGPT:企业版/教育版优先接入,支持多文件联合分析与可视化报告生成
  • API服务:通过Responses API保留推理中间状态,优化函数调用稳定性
  • 研究支持:提供定制化推理轨迹分析接口,支持学术用途申请

该模型标志着AI系统向自主工具调度与跨模态推理的重要突破,为复杂决策场景提供新的技术基座。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送

DataLearner 官方微信二维码