Grok 4 Heavy
Grok 4 Heavy is an AI model published by xAI, released on 2025-07-10, for AI model, and 128K tokens context length, with no open-source license.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Grok 4 Heavy currently shows benchmark results led by AIME2025 (1 / 106, score 100), GPQA Diamond (22 / 175, score 88.90), HLE (31 / 149, score 44.40). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.
Grok 4 Heavy is an AI model published by xAI, released on 2025-07-10, for AI model, and 128K tokens context length, with no open-source license.
Follow DataLearner on WeChat for AI model updates and research notes.

No curated comparisons for this model yet.
Want a custom combination? Open the compare tool