DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
  1. Home
  2. /
  3. Benchmarks
  4. /
  5. IMO-AnswerBench

IMO-AnswerBench

Updated Apr 24, 2026·1,144 views
Current SOTA
DeepSeek-AI
DeepSeek-V4-Pro
DeepSeek-AI
89.80Score
Problem Count
400
Institution
DeepMind
Category
Math and Reasoning
Metrics
Accuracy
Language
English
Difficulty
Mixed

Overview

IMO-AnswerBench is an AI benchmark used to evaluate model capabilities. Review its overview, metrics, official resources, and model leaderboard results on DataLearnerAI.

Related resources

  • View Paper
  • Get Dataset
  • Official Website
  • DataLearner Blog

Latest IMO-AnswerBench model rankings and full benchmark leaderboard

Browse the latest scores, model modes, release dates, and parameter sizes for IMO-AnswerBench.

Source: DataLearnerAI

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model Mode Legend
License:
Origin:
Model release cutoff:

IMO-AnswerBench Rank

RankModelLicense
DeepSeek-AI
DeepSeek-V4-Pro
Thinking Level · High
89.80
2026-04-241600BFree Commercial
DeepSeek-AI
DeepSeek-V4-Flash
Thinking Level · High
88.40
2026-04-24284BFree Commercial
DeepSeek-AI
DeepSeek-V4-Pro
Thinking Level · High
88.00
2026-04-241600BFree Commercial
4
StepFunAI
Step 3.5 Flash
Thinking EnabledTools
86.70
2026-02-02196BFree Commercial
5
Moonshot AI
Kimi K2.6
Thinking Enabled
86.00
2026-04-201000BFree Commercial
6
StepFunAI
Step 3.5 Flash
Thinking Enabled
85.40
2026-02-02196BFree Commercial
7
DeepSeek-AI
DeepSeek-V4-Flash
Thinking Level · High
85.10
2026-04-24284BFree Commercial
8
阿里巴巴
Qwen3-Max-Thinking
Thinking Enabled
83.90
2026-01-261000BClosed
9
智谱AI
GLM 5.1
Thinking Enabled
83.80
2026-03-2775.4BFree Commercial
10
阿里巴巴
Qwen 3.6 Plus Preview
Thinking Enabled
83.80
2026-03-31UnknownClosed
11
智谱AI
GLM-5
Thinking Enabled
82.50
2026-02-11744BFree Commercial
12
Moonshot AI
Kimi K2.5
Thinking Enabled
81.80
2026-01-271000BFree Commercial
13
阿里巴巴
Qwen3.5-397B-A17B
Thinking Enabled
80.90
2026-02-1639.7BFree Commercial
14
阿里巴巴
Qwen3.6-27B
Thinking Enabled
80.80
2026-04-2227BFree Commercial
15
阿里巴巴
Qwen3.6-35B-A3B
Thinking Enabled
78.90
2026-04-1635BFree Commercial
16
DeepSeek-AI
DeepSeek-V4-Flash
Standard Mode
41.90
2026-04-24284BFree Commercial
17
DeepSeek-AI
DeepSeek-V4-Pro
Standard Mode
35.30
2026-04-241600BFree Commercial