DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
目录
Model catalogGrok 4 Heavy
GR

Grok 4 Heavy

聊天大模型

Grok 4 Heavy

Release date: 2025-07-10更新于: 2025-07-10 12:38:351,533
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
128K
Chinese support
Supported
Reasoning ability

Grok 4 Heavy is an AI model published by xAI, released on 2025-07-10, for 聊天大模型, with 0.0B parameters, and 128K tokens context length, under the 不开源 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Grok 4 Heavy

Model basics

Reasoning traces
Supported
Thinking modes
Thinking modes not supported
Context length
128K tokens
Max output length
8192 tokens
Model type
聊天大模型
Release date
2025-07-10
Model file size
No data
MoE architecture
No
Total params / Active params
0.0B / N/A
Knowledge cutoff
No data
Grok 4 Heavy

Open source & experience

Code license
不开源
Weights license
不开源- 不开源
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Live demo
No live demo
Grok 4 Heavy

Official resources

Paper
No paper available
DataLearnerAI blog
No blog post yet
Grok 4 Heavy

API details

API speed
2/5
No public API pricing yet.
Grok 4 Heavy

Benchmark Results

Grok 4 Heavy currently shows benchmark results led by AIME2025 (1 / 107, score 100), GPQA Diamond (17 / 166, score 88.90), HLE (22 / 128, score 44.40). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking
All modesThinking
Tool usage
All modesWith toolsNo tools
Parallel
Exclude parallelAll parallel modes

综合评估

2 evaluations
Benchmark / mode
Score
Rank/total
GPQA Diamond
On
88.90
17 / 166
HLE
OnTools
44.40
22 / 128

编程与软件工程

1 evaluations
Benchmark / mode
Score
Rank/total
SWE-bench Verified
OnTools
73.50
30 / 95

数学推理

1 evaluations
Benchmark / mode
Score
Rank/total
AIME2025
On
100
1 / 107
View benchmark analysisCompare with other models
Grok 4 Heavy

Publisher

xAI
xAI
View publisher details
Grok 4 Heavy

Model Overview

xAI发布的大模型,在AIME 2025得分100%

DataLearner 官方微信

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送

DataLearner 官方微信二维码