GLM-5 vs GLM-4.5 评测对比

See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 2 个模型的评测数据与核心参数。

GLM-5

智谱AI

Release: 2026-02-11
Context length: 200K
Parameters: 7,440 (act 400)
最大输出: 131,072 tokens

Model profile·Playground

Best overall

GLM-5 · 71.40

Best single

GLM-5 · GPQA Diamond 86.00

Modality coverage

GLM-5 · 1 modalities

Head to head

GLM-5

GLM-4.5

AheadTiedBehind

Benchmarks

Wins

Losses

+18.83

Average diff

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Filter: Best Available·2 modes · 3 Benchmark

图表加载中...

Benchmark score table

Complete scores for each model/mode across selected benchmarks.

3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

Benchmark	GLM-5	GLM-4.5
GPQA Diamond 综合评估	86.00Thinking Enabled	79.10Thinking Enabled
HLE 综合评估	50.40Thinking Enabled ｜ Tools	14.40Thinking Enabled
SWE-bench Verified 编程与软件工程	77.80Thinking Enabled	64.20Thinking Enabled

API price comparison

Side-by-side input/output token pricing

Detailed feature breakdown

Licensing, MoE architecture, and multi-modality support.

Features & specs	GLM-5智谱AI	GLM-4.5智谱AI
Core specsRelease	2026-02-11	2025-07-28
Context length	200K	128K
Parameters	7440	3550
Active parameters	400	320
Max output	131072	97280
MoE	Yes	Yes
Supported modes	No mode data	常规模式（Non-Thinking Mode）思考模式（Thinking Mode）
LicenseCode Open Source	Not provided	Not provided
Weights Open Source	Closed Source	Not provided
Commercial use	免费商用授权	免费商用授权
Modality supportText Input/Output	/	/
ResourcesPaper / report	GLM-5: From Vibe Coding to Agentic Engineering	GLM-4.5: Reasoning, Coding, and Agentic Abililties
DataLearner blog	Not provided	Zhipu AI重磅发布GLM-4.5系列：技术深度解析与多维度性能评测