GLM-5vsGLM-4.5

Across 3 shared benchmarks, GLM-5 leads overall: GLM-5 wins 3, GLM-4.5 wins 0, with 0 ties and an average score difference of +18.83.

智谱AI · 2026-02-11 · AI model

智谱AI · 2025-07-28 · Reasoning model

GLM-53 wins(100%)(0%)0 winsGLM-4.5

Benchmark scores

Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.

GLM-5 2/2

Benchmark	GLM-5	GLM-4.5	Diff
HLE	50.4015 / 149thinking + 使用工具	14.40113 / 149thinking	+36
GPQA Diamond	8640 / 175Thinking (No Tools)	79.1077 / 175thinking	+6.90

GLM-5 1/1

Benchmark

Prices use DataLearner records when available; missing fields are not inferred.

On average across the 3 shared benchmarks, GLM-5 scores 18.83 higher.

Largest single-benchmark gap: HLE — GLM-5 50.40 vs GLM-4.5 14.40 (+36).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.