GLM-4.5-Air Benchmark Details

GLM-4.5-Air currently shows benchmark results led by MATH-500 (5 / 44, score 98.10), AIME 2024 (15 / 62, score 89.40), Pinch Bench (13 / 37, score 85.70).

Benchmark Results

GLM-4.5-Air

Benchmark Results

General Knowledge

3 evaluations

Benchmark / mode

Score

Rank/total

MMLU Pro

81.40

54 / 132

GPQA Diamond

100 / 187

HLE

10.60

143 / 170

Coding and Software Engineer

2 evaluations

Benchmark / mode

Score

Rank/total

LiveCodeBench

70.70

50 / 123

SWE-bench Verified

57.60

83 / 111

Math and Reasoning

2 evaluations

Benchmark / mode

Score

Rank/total

MATH-500

98.10

5 / 44

AIME 2024

89.40

15 / 62

AI Agent - Tool Usage

1 evaluations

Benchmark / mode

Score

Rank/total

Terminal-Bench

22 / 35

Claw-style Agent Evaluation

1 evaluations

Benchmark / mode

Score

Rank/total

Pinch Bench

Thinking EnabledTools

85.70

13 / 37

Compare with other models