Gemini 3.5 FlashvsGemini 3.0 Flash
Across 6 shared benchmarks, Gemini 3.5 Flash leads overall: Gemini 3.5 Flash wins 5, Gemini 3.0 Flash wins 1, with 0 ties and an average score difference of +16.53.
Google Deep Mind · 2026-06-20 · Multimodal model
Google Deep Mind · 2025-12-17 · Chat model
Benchmark scores
Grouped by capability, sorted by largest gap within each. 6 shared benchmarks.
General Knowledge
Gemini 3.5 Flash 2/3| Benchmark | Gemini 3.5 Flash | Gemini 3.0 Flash | Diff |
|---|---|---|---|
| ARC-AGI-2 | 72.1011 / 59Thinking High (With Tools) | 33.6027 / 59 | +38.50 |
| LiveBench | 75.0217 / 115Thinking High (No Tools) | 56.3579 / 115Normal (No Tools) | +18.67 |
| HLE | 40.2055 / 161Thinking High (With Tools) | 43.5040 / 161 | -3.30 |
AI Agent - Tool Usage
Gemini 3.5 Flash 2/2| Benchmark | Gemini 3.5 Flash | Gemini 3.0 Flash | Diff |
|---|---|---|---|
| MCP-Atlas | 83.601 / 23Thinking High (With Tools) | 6216 / 23Normal (With Tools) | +21.60 |
| TerminalBench 2.1 | 76.208 / 16Thinking High (With Tools) | 5815 / 16Thinking High (With Tools) | +18.20 |
Coding and Software Engineer
Gemini 3.5 Flash 1/1| Benchmark | Gemini 3.5 Flash | Gemini 3.0 Flash | Diff |
|---|---|---|---|
| SWE-Bench Pro - Public | 55.1021 / 44Thinking High (With Tools) | 49.6033 / 44Thinking High (With Tools) | +5.50 |
Specs
| Field | Gemini 3.5 Flash | Gemini 3.0 Flash |
|---|---|---|
| Publisher | Google Deep Mind | Google Deep Mind |
| Release date | 2026-06-20 | 2025-12-17 |
| Model type | Multimodal model | Chat model |
| Architecture | Dense | Dense |
| Parameters | Not available | Not available |
| Context length | 1M | 2000K |
| Max output | 64K | 64K |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | Gemini 3.5 Flash | Gemini 3.0 Flash |
|---|---|---|
| Text input | $1.5 / 1M tokens | Not public |
| Text output | $9 / 1M tokens | Not public |
One or both models have incomplete public pricing.
Summary
- Gemini 3.5 Flashleads in:General Knowledge (2/3), AI Agent - Tool Usage (2/2), Coding and Software Engineer (1/1)
On average across the 6 shared benchmarks, Gemini 3.5 Flash scores 16.53 higher.
Largest single-benchmark gap: ARC-AGI-2 — Gemini 3.5 Flash 72.10 vs Gemini 3.0 Flash 33.60 (+38.50).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.