Gemini-2.5-Pro-Preview-05-06 Benchmark Details

Gemini-2.5-Pro-Preview-05-06 currently shows benchmark results led by MATH-500 (1 / 44, score 98.80), AIME 2024 (9 / 62, score 92), Aider-Polyglot (9 / 59, score 76.90).

Benchmark Results

Gemini-2.5-Pro-Preview-05-06

Benchmark Results

Thinking

General Knowledge

2 evaluations
Benchmark / mode
Score
Rank/total
83
61 / 179
21.60
99 / 159

Common Sense

1 evaluations
Benchmark / mode
Score
Rank/total
54
10 / 45

Coding and Software Engineer

2 evaluations
Benchmark / mode
Score
Rank/total
77.10
34 / 120
63.20
73 / 108

Math and Reasoning

5 evaluations
Benchmark / mode
Score
Rank/total
98.80
1 / 44
92
9 / 62
83
51 / 106
10.30
25 / 60

Multimodal Understanding

1 evaluations
Benchmark / mode
Score
Rank/total
79.60
13 / 28

Agent Level Benchmark

1 evaluations
Benchmark / mode
Score
Rank/total
Aider-Polyglot
Standard Mode
76.90
9 / 59