GLM-OCR

Name: GLM-OCR
Author: 智谱AI

Vision modelOCR modelGLM-OCRGLM-OCR

GLM-OCR

Release date: 2026-02-03Updated: 2026-06-14 23:13:18.967926

Live demoGitHub Hugging Face Compare

Parameters

900M

Context length

Chinese support

Supported

Reasoning ability

GLM-OCR is an AI model published by 智谱AI, released on 2026-02-03, for Vision model, with 900M parameters, and 8K context length, requiring about 1.8GB storage, under the Apache 2.0 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

GLM-OCR

Model basics

Reasoning traces

Not supported

Thinking modes

Thinking modes not supported

Context length

8K tokens

Max output length

4K tokens

Model type

Vision model

Modality (in / out)

Text, Image → Text

Release date

2026-02-03

Model file size

1.8GB

MoE architecture

Total params / Active params

900M / N/A

Knowledge cutoff

No data

GLM-OCR

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/THUDM/GLM-OCR

Hugging Face

https://huggingface.co/THUDM/glm-ocr

Live demo

No live demo

GLM-OCR

Official resources

Paper

GLM-OCR: A Lightweight and Effective OCR Model

DataLearnerAI blog

No blog post yet

GLM-OCR

API details

API speed

5/5

No public API pricing yet.

GLM-OCR

Benchmark Results

No benchmark data to show.

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

GLM-OCR

Publisher

智谱AI

View publisher details

GLM-OCR

Model Overview

GLM-OCR is an AI model published by 智谱AI, released on 2026-02-03, for Vision model, with 900M parameters, and 8K context length, requiring about 1.8GB storage, under the Apache 2.0 license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.