GLM-4.5

Name: GLM-4.5-MoE-355B-A32B-0715
Availability: InStock
Author: 智谱AI

Reasoning model

GLM-4.5-MoE-355B-A32B-0715

Release date: 2025-07-28Updated: 2025-07-29 11:11:411,596

Live demo GitHub Hugging Face Compare

Parameters

355B

Context length

128K

Chinese support

Supported

Reasoning ability

GLM-4.5-MoE-355B-A32B-0715 is an AI model published by 智谱AI, released on 2025-07-28, for Reasoning model, with 3550.0B parameters, and 128K tokens context length, requiring about 710 GB storage, under the Apache 2.0 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

GLM-4.5

Model basics

Reasoning traces

Supported

Thinking modes

Thinking modes not supported

Context length

128K tokens

Max output length

97280 tokens

Model type

Reasoning model

Release date

2025-07-28

Model file size

710 GB

MoE architecture

Yes

Total params / Active params

355B / 32B

Knowledge cutoff

No data

GLM-4.5

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/THUDM/GLM-4

Hugging Face

https://huggingface.co/zai-org/GLM-4.5

Live demo

https://chat.z.ai/

GLM-4.5

Official resources

Paper

GLM-4.5: Reasoning, Coding, and Agentic Abililties

DataLearnerAI blog

GLM-4.5

API details

API speed

3/5

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Standard pricingStandard

Modality	Input	Output
Text	$0.6	$2.2

GLM-4.5

Benchmark Results

GLM-4.5 currently shows benchmark results led by MATH-500 (3 / 44, score 98.20), AIME 2024 (14 / 62, score 91), MMLU Pro (30 / 124, score 84.60). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.