Gemini 3.1 Pro PreviewvsGemini 3.0 Pro (Preview 11-2025)

在 15 个共同 benchmark 中，Gemini 3.1 Pro Preview 整体领先：Gemini 3.1 Pro Preview 领先 12 项，Gemini 3.0 Pro (Preview 11-2025) 领先 3 项，持平 0 项，平均分差 +7.84。

Google Deep Mind · 2026-02-20 · 多模态大模型

Google Deep Mind · 2025-11-18 · 多模态大模型

Gemini 3.1 Pro Preview12 项(80%)(20%)3 项Gemini 3.0 Pro (Preview 11-2025)

评测分数

按能力类目分组，每组内按分差大小排列；共 15 项。

Gemini 3.1 Pro Preview 领先 4/4

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
ARC-AGI-2	77.109 / 62Thinking High (No Tools)	45.1026 / 62	+32
LiveBench	79.933 / 115Thinking High (No Tools)	73.3924 / 115Thinking High (No Tools)	+6.54
HLE	51.4022 / 172Thinking High (With Tools)	45.8040 / 172	+5.60
GPQA Diamond	94.303 / 187Thinking High (No Tools)	93.805 / 187	+0.50

Gemini 3.1 Pro Preview 领先 2/2

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
τ²-Bench	90.802 / 43Thinking High (With Tools)	85.408 / 43	+5.40
τ²-Bench - Telecom	99.301 / 35Thinking High (With Tools)	985 / 35	+1.30

Gemini 3.1 Pro Preview 领先 2/2

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
Terminal Bench 2.0	68.508 / 47Thinking High (With Tools)	56.9025 / 47	+11.60
MCP-Atlas	78.209 / 27Thinking High (With Tools)	70.3015 / 27Normal (With Tools)	+7.90

胶着 2/2

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
SWE-bench Verified	80.6011 / 112Thinking High (With Tools)	76.2036 / 112	+4.40
LiveCodeBench	91.703 / 123Thinking High (With Tools)	922 / 123	-0.30

Gemini 3.0 Pro (Preview 11-2025) 领先 2/2

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
FrontierMath - Tier 4	16.7020 / 80Normal (No Tools)	18.8016 / 80	-2.10
FrontierMath	36.9011 / 60Thinking High (No Tools)	3810 / 60	-1.10

Gemini 3.1 Pro Preview 领先 1/1

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
BrowseComp	85.905 / 53Thinking High (With Tools + Internet)	59.2038 / 53	+26.70

Gemini 3.1 Pro Preview 领先 1/1

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
Pinch Bench	86.7010 / 37Thinking (With Tools)	70.7031 / 37Thinking (With Tools)	+16

Gemini 3.1 Pro Preview 领先 1/1

评测项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)	分差
Simple Bench	79.602 / 63Normal (No Tools)	76.405 / 63Thinking (No Tools)	+3.20

价格优先使用 DataLearner 配置的 API 记录；缺失项不做推测。

价格项	Gemini 3.1 Pro Preview	Gemini 3.0 Pro (Preview 11-2025)
文本输入	$2 / 1M tokens	$2 / 1M tokens
文本输出	$12 / 1M tokens	$12 / 1M tokens

Gemini 3.1 Pro Preview在以下类目领先:General Knowledge (4/4)、Agent Level Benchmark (2/2)、AI Agent - Tool Usage (2/2)、AI Agent - Information Search (1/1)、Claw-style Agent Evaluation (1/1)、常识推理 (1/1)
Gemini 3.0 Pro (Preview 11-2025)在以下类目领先:Math and Reasoning (2/2)
胶着类目:Coding and Software Engineer

15 个共同 benchmark 上，Gemini 3.1 Pro Preview 平均高出 7.84 分。

单项差距最大的 benchmark：ARC-AGI-2 — Gemini 3.1 Pro Preview 77.10，Gemini 3.0 Pro (Preview 11-2025) 45.10（分差 +32）。

本页正文由结构化模型、价格与 benchmark 数据生成，不使用实时 LLM 撰写。