See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

Gemma 4 31B
DeepMind
Best overall
Gemma 4 31B · 83.17
Best single
Gemma 4 31B · MMLU Pro 85.20
Modality coverage
Gemma 4 31B · 3 modalities
Head to head
3
Benchmarks
3
Wins
0
Losses
+36.63
Average diff
Compare benchmark results across thinking modes and tool usage.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Complete scores for each model/mode across selected benchmarks.
3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.
| Benchmark | Gemma 4 31B | Gemma 3 - 27B (IT) |
|---|---|---|
GPQA Diamond 综合评估 | 84.30Thinking Enabled | 42.40Standard Mode |
MMLU Pro 综合评估 | 85.20Thinking Enabled | 67.50Standard Mode |
LiveCodeBench 编程与软件工程 | 80.00Thinking Enabled | 29.70Standard Mode |
Side-by-side input/output token pricing
Licensing, MoE architecture, and multi-modality support.
| Features & specs | Gemma 4 31BDeepMind | Gemma 3 - 27B (IT)Google Deep Mind |
|---|---|---|
Core specsRelease | 2026-04-02 | 2025-03-12 |
Context length | 256K | 128K |
Parameters | 31 | 270 |
Active parameters | 31 | Not provided |
Max output | 32768 | Not provided |
MoE | No | No |
LicenseCode Open Source | Not provided | Not provided |
Weights Open Source | Not provided | Not provided |
Commercial use | 免费商用授权 | 免费商用授权 |
Modality supportText Input/Output | / | Not provided |
Image Input/Output | / | Not provided |
Video Input/Output | / | Not provided |
ResourcesPaper / report | Gemma 4: Byte for byte, the most capable open models | Gemma 3 Technical Report |
DataLearner blog | Google Gemma 4 正式开源:Apache 2.0 协议、手机端可运行、原生支持多模态和 Agent 工作流 | Google开源第三代Gemma-3系列模型:支持多模态、最多128K输入,其中Gemma 3-27B在大模型匿名竞技场得分超过了Qwen2.5-Max |