DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
  1. Home
  2. /
  3. Benchmarks
  4. /
  5. Simple Bench

Simple Bench

Updated Jun 23, 2026·1,397 views
Current SOTA
Anthropic
Claude Fable 5
Anthropic
81.90Score
Problem Count
200
Institution
—
Category
Math and Reasoning
Metrics
Accuracy
Language
English
Difficulty
Mixed

Overview

Simple Bench is an AI benchmark used to evaluate model capabilities. Review its overview, metrics, official resources, and model leaderboard results on DataLearnerAI.

Related resources

  • View Paper
  • Get Dataset
  • Official Website
  • DataLearner Blog

Latest Simple Bench model rankings and full benchmark leaderboard

Browse the latest scores, model modes, release dates, and parameter sizes for Simple Bench.

Source: DataLearnerAI

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model Mode Legend
License:
Origin:
Model release cutoff:

Simple Bench Rank

RankModelLicense
Anthropic
Claude Fable 5
Standard Mode
81.90
2026-06-09UnknownClosed
Google Deep Mind
Gemini 3.1 Pro Preview
Standard Mode
79.60
2026-02-20UnknownClosed
OpenAI
GPT-5.5 Pro
Standard Mode
76.90
2026-04-23UnknownClosed
4
Google Deep Mind
Gemini 3.5 Flash
Standard Mode
76.70
2026-06-20UnknownClosed
5
Google Deep Mind
Gemini 3.0 Pro (Preview 11-2025)
Thinking Enabled
76.40
2025-11-18UnknownClosed
6
阿里巴巴
Qwen3.7 Max
Standard Mode
70.40
2026-03-01UnknownClosed
7
OpenAI
GPT-5.5
Standard Mode
69.00
2026-04-23UnknownClosed
8
Anthropic
Claude Opus 4.6
Standard Mode
67.60
2026-02-05UnknownClosed
9
Anthropic
Claude Opus 4.8
Standard Mode
64.80
2026-05-28UnknownClosed
10
阿里巴巴
Qwen3.6-Max-Preview
Standard Mode
63.00
2026-04-201000BClosed
11
Google Deep Mind
Gemini 2.5-Pro
Thinking Enabled
62.40
2025-06-05UnknownClosed
12
Anthropic
Opus 4.5
Extended Thinking
62.00
2025-11-25UnknownClosed
13
Anthropic
Opus 4.7
Standard Mode
61.70
2026-04-16UnknownClosed
14
OpenAI
GPT-5-Pro
Thinking Enabled
61.60
2025-08-07UnknownClosed
15
xAI
Grok 4
Thinking Enabled
60.50
2025-07-10UnknownClosed
16
Anthropic
Opus 4.1
Extended Thinking
60.00
2025-08-06UnknownClosed
17
Anthropic
Claude Opus 4
Thinking Enabled
58.80
2025-05-23UnknownClosed
18
智谱AI
GLM-5.2
Standard Mode
58.80
2026-06-13753.3BFree Commercial
19
OpenAI
GPT-5.2 Pro
Thinking Level · Extra High
57.40
2025-12-11UnknownClosed
20
OpenAI
GPT-5
Thinking Level · High
56.70
2025-08-07UnknownClosed
21
xAI
Grok 4.1 Fast
Standard Mode
56.00
2025-11-19UnknownClosed
22
Anthropic
Claude Sonnet 4.5
Standard Mode
54.30
2025-09-30UnknownClosed
23
OpenAI
GPT-5.1
Thinking Level · High
53.20
2025-11-12UnknownClosed
24
智谱AI
GLM-5
Standard Mode
53.20
2026-02-11744BFree Commercial
25
OpenAI
OpenAI o3
Thinking Level · High
53.10
2025-04-16UnknownClosed
26
DeepSeek-AI
DeepSeek V3.2 Speciale
Standard Mode
52.60
2025-12-01UnknownFree Commercial
27
Google Deep Mind
Gemini 2.5 Pro Experimental 03-25
Standard Mode
51.60
2025-03-25UnknownClosed
28
DeepSeek-AI
DeepSeek-V4-Pro
Standard Mode
50.90
2026-04-241600BFree Commercial
29
智谱AI
GLM-4.7
Thinking Enabled
47.70
2025-12-22358BFree Commercial
30
Moonshot AI
Kimi K2.5
Thinking Enabled
46.80
2026-01-271000BFree Commercial
Scroll to load 33 more