- Gemini 3.1 Pro Previewleads in:General Knowledge (3/3), Agent Level Benchmark (2/2), AI Agent - Information Search (1/1), AI Agent - Tool Usage (1/1), Claw-style Agent Evaluation (1/1)
- Gemini 3.0 Pro (Preview 11-2025)leads in:Math and Reasoning (2/2)
- Tied in:Coding and Software Engineer
On average across the 12 shared benchmarks, Gemini 3.1 Pro Preview scores 8.33 higher.
Largest single-benchmark gap: ARC-AGI-2 — Gemini 3.1 Pro Preview 77.10 vs Gemini 3.0 Pro (Preview 11-2025) 45.10 (+32).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.