DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
  1. Home
  2. /
  3. Benchmarks
  4. /
  5. MMLU Pro

MMLU Pro

Updated Apr 24, 2026·3,968 views
Current SOTA
OpenAI
OpenAI o1
OpenAI
91.04Score
Problem Count
38500
Institution
Berkeley Artificial Intelligence Research
Category
General Evaluation
Metrics
Accuracy
Language
English
Difficulty
Medium

Overview

MMLU Pro is an AI benchmark used to evaluate model capabilities. Review its overview, metrics, official resources, and model leaderboard results on DataLearnerAI.

Related resources

  • View Paper
  • Get Dataset
  • Official Website
  • DataLearner Blog

Latest MMLU Pro model rankings and full benchmark leaderboard

Browse the latest scores, model modes, release dates, and parameter sizes for MMLU Pro.

Source: DataLearnerAI

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model Mode Legend
License:
Origin:
Model release cutoff:

MMLU Pro Rank

RankModelLicense
OpenAI
OpenAI o1
Standard Mode
91.04
2024-12-05UnknownClosed
Google Deep Mind
Gemini 3.0 Pro (Preview 11-2025)
Thinking Enabled
90.00
2025-11-18UnknownClosed
Anthropic
Opus 4.5
Extended Thinking
90.00
2025-11-25UnknownClosed
4
阿里巴巴
Qwen 3.6 Plus Preview
Thinking Enabled
88.50
2026-03-31UnknownClosed
5
Anthropic
Opus 4.1
Extended Thinking
88.00
2025-08-06UnknownClosed
6
Anthropic
Claude Sonnet 4.5
Thinking Enabled
88.00
2025-09-30UnknownClosed
7
MiniMaxAI
M2.1
Thinking Enabled
88.00
2025-12-23230BFree Commercial
8
阿里巴巴
Qwen3.5-397B-A17B
Thinking Enabled
87.80
2026-02-1639.7BFree Commercial
9
DeepSeek-AI
DeepSeek-V4-Pro
Thinking Level · High
87.50
2026-04-241600BFree Commercial
10
腾讯AI实验室
Hunyuan-T1
Standard Mode
87.20
2025-03-21UnknownClosed
11
DeepSeek-AI
DeepSeek-V4-Pro
Thinking Level · High
87.10
2026-04-241600BFree Commercial
12
xAI
Grok 4
Thinking Enabled
87.00
2025-07-10UnknownClosed
13
DeepSeek-AI
DeepSeek-V4-Flash
Thinking Level · High
86.40
2026-04-24284BFree Commercial
14
阿里巴巴
Qwen3.6-27B
Thinking Enabled
86.20
2026-04-2227BFree Commercial
15
DeepSeek-AI
DeepSeek-V4-Flash
Thinking Level · High
86.20
2026-04-24284BFree Commercial
16
OpenAI
GPT-4.5
Standard Mode
86.10
2025-02-28UnknownClosed
17
阿里巴巴
Qwen3.5-27B
Thinking Enabled
86.10
2026-02-2527BFree Commercial
18
Google Deep Mind
Gemini 2.5-Pro
Standard Mode
86.00
2025-06-05UnknownClosed
19
阿里巴巴
Qwen3-Max-Thinking
Thinking Enabled
85.70
2026-01-261000BClosed
20
OpenAI
OpenAI o3
Standard Mode
85.60
2025-04-16UnknownClosed
21
DeepMind
Gemma 4 31B
Thinking Enabled
85.20
2026-04-023.1BFree Commercial
22
阿里巴巴
Qwen3.6-35B-A3B
Thinking Enabled
85.20
2026-04-1635BFree Commercial
23
Anthropic
Claude Opus 4
Standard Mode
85.00
2025-05-23UnknownClosed
24
DeepSeek-AI
DeepSeek-R1-0528
Thinking Enabled
85.00
2025-05-28671BFree Commercial
25
DeepSeek-AI
DeepSeek-V3.1
Thinking Enabled
85.00
2025-08-20671BFree Commercial
26
DeepSeek-AI
DeepSeek-V3.1 Terminus
Thinking Enabled
85.00
2025-09-22671BFree Commercial
27
DeepSeek-AI
DeepSeek-V3.1 Terminus
Standard Mode
85.00
2025-09-22671BFree Commercial
28
DeepSeek-AI
DeepSeek V3.2-Exp
Thinking Enabled
85.00
2025-09-29671BFree Commercial
29
xAI
Grok 4.1 Fast
Thinking Enabled
85.00
2025-11-19UnknownClosed
30
智谱AI
GLM-4.5
Thinking Enabled
84.60
2025-07-28355BFree Commercial
Scroll to load 94 more