Gemini 3.5 FlashvsGemini 2.5 Flash
Across 3 shared benchmarks, Gemini 3.5 Flash leads overall: Gemini 3.5 Flash wins 3, Gemini 2.5 Flash wins 0, with 0 ties and an average score difference of +30.66.
Google Deep Mind · 2026-06-20 · Multimodal model
Google Deep Mind · 2025-04-17 · Reasoning model
Benchmark scores
Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.
General Knowledge
Gemini 3.5 Flash 2/2| Benchmark | Gemini 3.5 Flash | Gemini 2.5 Flash | Diff |
|---|---|---|---|
| HLE | 40.2053 / 159Thinking High (With Tools) | 11129 / 159 | +29.20 |
| LiveBench | 75.0217 / 115Thinking High (No Tools) | 47.74101 / 115Thinking High (No Tools) | +27.28 |
Math and Reasoning
Gemini 3.5 Flash 1/1| Benchmark | Gemini 3.5 Flash | Gemini 2.5 Flash | Diff |
|---|---|---|---|
| Simple Bench | 76.704 / 63Normal (No Tools) | 41.2037 / 63Normal (No Tools) | +35.50 |
Specs
| Field | Gemini 3.5 Flash | Gemini 2.5 Flash |
|---|---|---|
| Publisher | Google Deep Mind | Google Deep Mind |
| Release date | 2026-06-20 | 2025-04-17 |
| Model type | Multimodal model | Reasoning model |
| Architecture | Dense | Dense |
| Parameters | Not available | Not available |
| Context length | 1M | 1000K |
| Max output | 64K | 64K |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | Gemini 3.5 Flash | Gemini 2.5 Flash |
|---|---|---|
| Text input | $1.5 / 1M tokens | Not public |
| Text output | $9 / 1M tokens | Not public |
One or both models have incomplete public pricing.
Summary
- Gemini 3.5 Flashleads in:General Knowledge (2/2), Math and Reasoning (1/1)
On average across the 3 shared benchmarks, Gemini 3.5 Flash scores 30.66 higher.
Largest single-benchmark gap: Simple Bench — Gemini 3.5 Flash 76.70 vs Gemini 2.5 Flash 41.20 (+35.50).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.