Grok 2 Benchmark Details

Grok 2 currently shows benchmark results led by MMLU (22 / 65, score 87.50), MATH (15 / 42, score 76.10), HumanEval (14 / 39, score 88.40).

Benchmark Results

Grok 2

Benchmark Results

Thinking

General Knowledge

3 evaluations
Benchmark / mode
Score
Rank/total
87.50
22 / 65
75.50
81 / 126
56
147 / 179

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
88.40
14 / 39

Math and Reasoning

2 evaluations
Benchmark / mode
Score
Rank/total
76.10
15 / 42
0.70
55 / 60

常识推理

1 evaluations
Benchmark / mode
Score
Rank/total
Simple Bench
Standard Mode
22.70
56 / 63