GPT-5.4 minivsGPT-5-mini
Across 3 shared benchmarks, GPT-5.4 mini leads overall: GPT-5.4 mini wins 2, GPT-5-mini wins 1, with 0 ties and an average score difference of +17.10.
GPT-5.4 mini
OpenAI · 2026-03-17 · Reasoning model
GPT-5-mini
OpenAI · 2025-08-07 · Foundation model
GPT-5.4 mini2 wins(67%)(33%)1 winGPT-5-mini
Benchmark scores
Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.
General Knowledge
GPT-5.4 mini 2/2| Benchmark | GPT-5.4 mini | GPT-5-mini | Diff |
|---|---|---|---|
| HLE | 41.5046 / 157极高强度思考(工具) | 5153 / 157 | +36.50 |
| GPQA Diamond | 8832 / 178极高强度思考(无工具) | 69118 / 178 | +19 |
Math and Reasoning
GPT-5-mini 1/1| Benchmark | GPT-5.4 mini | GPT-5-mini | Diff |
|---|---|---|---|
| FrontierMath - Tier 4 | 2.1056 / 80Thinking High (No Tools) | 6.3035 / 80Thinking High (No Tools) | -4.20 |
Specs
| Field | GPT-5.4 mini | GPT-5-mini |
|---|---|---|
| Publisher | OpenAI | OpenAI |
| Release date | 2026-03-17 | 2025-08-07 |
| Model type | Reasoning model | Foundation model |
| Architecture | Dense | Dense |
| Parameters | Not available | Not available |
| Context length | 400K | 400K |
| Max output | 128K | 128K |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | GPT-5.4 mini | GPT-5-mini |
|---|---|---|
| Text input | $0.75 / 1M tokens | Not public |
| Text output | $4.5 / 1M tokens | Not public |
| Cache read | $4.5 / 1M tokens | Not public |
| Cache write | $0.075 / 1M tokens | Not public |
One or both models have incomplete public pricing.
Summary
- GPT-5.4 minileads in:General Knowledge (2/2)
- GPT-5-minileads in:Math and Reasoning (1/1)
On average across the 3 shared benchmarks, GPT-5.4 mini scores 17.10 higher.
Largest single-benchmark gap: HLE — GPT-5.4 mini 41.50 vs GPT-5-mini 5 (+36.50).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.