Gemini-2.5-Pro-Preview-05-06 Benchmark Details

Gemini-2.5-Pro-Preview-05-06 currently shows benchmark results led by MATH-500 (1 / 44, score 98.80), AIME 2024 (9 / 62, score 92), Aider-Polyglot (9 / 59, score 76.90).

Benchmark Results

Gemini-2.5-Pro-Preview-05-06

Benchmark Results

General Knowledge

2 evaluations

Benchmark / mode

Score

Rank/total

GPQA Diamond

61 / 179

HLE

21.60

99 / 159

Common Sense

1 evaluations

Benchmark / mode

Score

Rank/total

SimpleQA

10 / 45

Coding and Software Engineer

2 evaluations

Benchmark / mode

Score

Rank/total

LiveCodeBench

77.10

34 / 120

SWE-bench Verified

63.20

73 / 108

Math and Reasoning

5 evaluations

Benchmark / mode

Score

Rank/total

MATH-500

98.80

1 / 44

AIME 2024

9 / 62

AIME2025

51 / 106

FrontierMath

10.30

25 / 60

FrontierMath - Tier 4

2.10

56 / 80

Multimodal Understanding

1 evaluations

Benchmark / mode

Score

Rank/total

MMMU

79.60

13 / 28

Agent Level Benchmark

1 evaluations

Benchmark / mode

Score

Rank/total

Aider-Polyglot

Standard Mode

76.90

9 / 59

Compare with other models