Gemini 3.5 FlashvsGemini 2.5 Flash

Across 3 shared benchmarks, Gemini 3.5 Flash leads overall: Gemini 3.5 Flash wins 3, Gemini 2.5 Flash wins 0, with 0 ties and an average score difference of +30.66.

Google Deep Mind
Gemini 3.5 Flash

Google Deep Mind · 2026-06-20 · Multimodal model

Google Deep Mind
Gemini 2.5 Flash

Google Deep Mind · 2025-04-17 · Reasoning model

Gemini 3.5 Flash3 wins(100%)(0%)0 winsGemini 2.5 Flash

Benchmark scores

Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.

General Knowledge

Gemini 3.5 Flash 2/2
BenchmarkGemini 3.5 FlashGemini 2.5 FlashDiff
HLE40.2053 / 159Thinking High (With Tools)11129 / 159+29.20
LiveBench75.0217 / 115Thinking High (No Tools)47.74101 / 115Thinking High (No Tools)+27.28

Math and Reasoning

Gemini 3.5 Flash 1/1
BenchmarkGemini 3.5 FlashGemini 2.5 FlashDiff
Simple Bench76.704 / 63Normal (No Tools)41.2037 / 63Normal (No Tools)+35.50

Specs

FieldGemini 3.5 FlashGemini 2.5 Flash
PublisherGoogle Deep MindGoogle Deep Mind
Release date2026-06-202025-04-17
Model typeMultimodal modelReasoning model
ArchitectureDenseDense
ParametersNot availableNot available
Context length1M1000K
Max output64K64K

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

ItemGemini 3.5 FlashGemini 2.5 Flash
Text input$1.5 / 1M tokensNot public
Text output$9 / 1M tokensNot public

One or both models have incomplete public pricing.

Summary

  • Gemini 3.5 Flashleads in:General Knowledge (2/2), Math and Reasoning (1/1)

On average across the 3 shared benchmarks, Gemini 3.5 Flash scores 30.66 higher.

Largest single-benchmark gap: Simple Bench — Gemini 3.5 Flash 76.70 vs Gemini 2.5 Flash 41.20 (+35.50).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.