GPT-5.4 minivsGPT-5-mini

Across 3 shared benchmarks, GPT-5.4 mini leads overall: GPT-5.4 mini wins 2, GPT-5-mini wins 1, with 0 ties and an average score difference of +17.10.

OpenAI
GPT-5.4 mini

OpenAI · 2026-03-17 · Reasoning model

OpenAI
GPT-5-mini

OpenAI · 2025-08-07 · Foundation model

GPT-5.4 mini2 wins(67%)(33%)1 winGPT-5-mini

Benchmark scores

Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.

General Knowledge

GPT-5.4 mini 2/2
BenchmarkGPT-5.4 miniGPT-5-miniDiff
HLE41.5046 / 157极高强度思考(工具)5153 / 157+36.50
GPQA Diamond8832 / 178极高强度思考(无工具)69118 / 178+19

Math and Reasoning

GPT-5-mini 1/1
BenchmarkGPT-5.4 miniGPT-5-miniDiff
FrontierMath - Tier 42.1056 / 80Thinking High (No Tools)6.3035 / 80Thinking High (No Tools)-4.20

Specs

FieldGPT-5.4 miniGPT-5-mini
PublisherOpenAIOpenAI
Release date2026-03-172025-08-07
Model typeReasoning modelFoundation model
ArchitectureDenseDense
ParametersNot availableNot available
Context length400K400K
Max output128K128K

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

ItemGPT-5.4 miniGPT-5-mini
Text input$0.75 / 1M tokensNot public
Text output$4.5 / 1M tokensNot public
Cache read$4.5 / 1M tokensNot public
Cache write$0.075 / 1M tokensNot public

One or both models have incomplete public pricing.

Summary

  • GPT-5.4 minileads in:General Knowledge (2/2)
  • GPT-5-minileads in:Math and Reasoning (1/1)

On average across the 3 shared benchmarks, GPT-5.4 mini scores 17.10 higher.

Largest single-benchmark gap: HLE — GPT-5.4 mini 41.50 vs GPT-5-mini 5 (+36.50).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.