GLM-5.2vsGLM-5

Across 4 shared benchmarks, GLM-5.2 leads overall: GLM-5.2 wins 4, GLM-5 wins 0, with 0 ties and an average score difference of +6.12.

智谱AI · 2026-06-13 · Reasoning model

智谱AI · 2026-02-11 · Chat model

GLM-5.24 wins(100%)(0%)0 winsGLM-5

Benchmark scores

Grouped by capability, sorted by largest gap within each. 4 shared benchmarks.

GLM-5.2 2/2

Benchmark	GLM-5.2	GLM-5	Diff
GPQA Diamond	91.2015 / 179Thinking (No Tools)	8644 / 179Thinking (No Tools)	+5.20
HLE	54.708 / 159Thinking (With Tools)	50.4019 / 159	+4.30

GLM-5.2 2/2

Benchmark	GLM-5.2	GLM-5	Diff
IMO-AnswerBench	911 / 20Thinking (No Tools)	82.5014 / 20Thinking (No Tools)	+8.50
AIME 2026	99.201 / 15Thinking (No Tools)	92.708 / 15Thinking (No Tools)	+6.50

Prices use DataLearner records when available; missing fields are not inferred.

On average across the 4 shared benchmarks, GLM-5.2 scores 6.12 higher.

Largest single-benchmark gap: IMO-AnswerBench — GLM-5.2 91 vs GLM-5 82.50 (+8.50).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.