GLM-4.6 Benchmark Details

GLM-4.6 currently shows benchmark results led by AIME2025 (15 / 106, score 98.60), LiveCodeBench (18 / 120, score 84.50), MMLU Pro (43 / 126, score 83).

Benchmark Results

GLM-4.6

Benchmark Results

Thinking

General Knowledge

9 evaluations
Benchmark / mode
Score
Rank/total
83
43 / 126
78
69 / 126
82.90
62 / 179
81
70 / 179
63
136 / 179
LiveBench
Standard Mode
55.19
81 / 115
30.40
76 / 159
17.20
118 / 159
5.20
152 / 159

Coding and Software Engineer

5 evaluations
Benchmark / mode
Score
Rank/total
84.50
18 / 120
82.80
24 / 120
56
79 / 120

Math and Reasoning

4 evaluations
Benchmark / mode
Score
Rank/total
98.60
15 / 106
98.60
15 / 106
44
92 / 106
2.10
56 / 80

AI Agent - Tool Usage

1 evaluations
Benchmark / mode
Score
Rank/total
40.50
12 / 35

Agent Level Benchmark

2 evaluations
Benchmark / mode
Score
Rank/total
75.90
20 / 40

Instruction Following

1 evaluations
Benchmark / mode
Score
Rank/total
43
29 / 29

AI Agent - Information Search

1 evaluations
Benchmark / mode
Score
Rank/total
45.10
38 / 45