GLM-5vsGLM-4.7
Across 9 shared benchmarks, GLM-5 leads overall: GLM-5 wins 7, GLM-4.7 wins 1, with 1 ties and an average score difference of +7.52.
GLM-57 wins(78%)Ties1(11%)1 winGLM-4.7
Benchmark scores
Grouped by capability, sorted by largest gap within each. 9 shared benchmarks.
Agent Level Benchmark
GLM-5 2/2| Benchmark | GLM-5 | GLM-4.7 | Diff |
|---|---|---|---|
| Terminal Bench Hard | 432 / 13thinking + 使用工具 | 33.307 / 13thinking + 使用工具 | +9.70 |
| τ²-Bench | 89.704 / 40thinking + 使用工具 | 87.406 / 40thinking + 使用工具 | +2.30 |
General Knowledge
GLM-5 2/2| Benchmark |
|---|
Specs
| Field | GLM-5 | GLM-4.7 |
|---|---|---|
| Publisher | 智谱AI | 智谱AI |
| Release date | 2026-02-11 | 2025-12-22 |
| Model type | AI model | AI model |
| Architecture | MoE | MoE |
| Parameters | 7440.0 | 3580.0 |
| Context length | 200K | 200K |
| Max output | 131072 | 132072 |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | GLM-5 | GLM-4.7 |
|---|---|---|
| Text input | $1 / 1M tokens | 0.6 美元/100万 tokens |
| Text output | $3.2 / 1M tokens | 2.2 美元/100万 tokens |
| Cache read | Not public | 0.11 美元/100万 tokens |
| Cache write | $0.2 / 1M tokens | Not public |
Summary
- GLM-5leads in:Agent Level Benchmark (2/2), General Knowledge (2/2), AI Agent - Information Search (1/1), AI Agent - Tool Usage (1/1), Coding and Software Engineer (1/1)
- GLM-4.7leads in:Math and Reasoning (1/2)
On average across the 9 shared benchmarks, GLM-5 scores 7.52 higher.
Largest single-benchmark gap: BrowseComp — GLM-5 75.90 vs GLM-4.7 52 (+23.90).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.