GPT-5.4 minivsGemini 3.0 Flash
Across 5 shared benchmarks, Gemini 3.0 Flash leads overall: GPT-5.4 mini wins 1, Gemini 3.0 Flash wins 4, with 0 ties and an average score difference of -0.90.
GPT-5.4 mini
OpenAI · 2026-03-17 · Reasoning model
Gemini 3.0 Flash
Google Deep Mind · 2025-12-17 · AI model
GPT-5.4 mini1 win(20%)(80%)4 winsGemini 3.0 Flash
Benchmark scores
Grouped by capability, sorted by largest gap within each. 5 shared benchmarks.
General Knowledge
Gemini 3.0 Flash 2/2| Benchmark | GPT-5.4 mini | Gemini 3.0 Flash | Diff |
|---|---|---|---|
| GPQA Diamond | 8829 / 175极高强度思考(无工具) | 90.4015 / 175thinking | -2.40 |
| HLE | 41.5041 / 149极高强度思考(工具) | 43.5033 / 149thinking + 使用工具 | -2 |
AI Agent - Tool Usage
GPT-5.4 mini 1/1| Benchmark |
|---|
Specs
| Field | GPT-5.4 mini | Gemini 3.0 Flash |
|---|---|---|
| Publisher | OpenAI | Google Deep Mind |
| Release date | 2026-03-17 | 2025-12-17 |
| Model type | Reasoning model | AI model |
| Architecture | Dense | Dense |
| Parameters | 0.0 | 0.0 |
| Context length | 400K | 2000K |
| Max output | 131072 | 65536 |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | GPT-5.4 mini | Gemini 3.0 Flash |
|---|---|---|
| Text input | $0.75 / 1M tokens | 0.5 美元/100万 tokens |
| Text output | $4.5 / 1M tokens | 3 美元/100万 tokens |
| Cache read | $4.5 / 1M tokens | 0.05 美元/100万 tokens |
| Cache write | $0.075 / 1M tokens | Not public |
Summary
- GPT-5.4 minileads in:AI Agent - Tool Usage (1/1)
- Gemini 3.0 Flashleads in:General Knowledge (2/2), Claw-style Agent Evaluation (1/1), Math and Reasoning (1/1)
On average across the 5 shared benchmarks, Gemini 3.0 Flash scores 0.90 higher.
Largest single-benchmark gap: Terminal Bench 2.0 — GPT-5.4 mini 60 vs Gemini 3.0 Flash 47.60 (+12.40).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.