DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
目录
Model catalogQwen2.5-MaxBenchmark analysis

Qwen2.5-Max Benchmark Details

Qwen2.5-Max currently shows benchmark results led by MMLU (21 / 65, score 87.90), GSM8K (9 / 26, score 94.50), MBPP (10 / 28, score 80.60).

Benchmark Results

Qwen2.5-Max

Benchmark Results

Thinking

综合评估

2 evaluations
Benchmark / mode
Score
Rank/total
MMLU
Standard Mode
87.90
21 / 65
MMLU Pro
Standard Mode
76.10
74 / 124

数学推理

3 evaluations
Benchmark / mode
Score
Rank/total
GSM8K
Standard Mode
94.50
9 / 26
MATH
Standard Mode
68.50
24 / 42
FrontierMath
Standard Mode
1
52 / 60

编程与软件工程

2 evaluations
Benchmark / mode
Score
Rank/total
MBPP
Standard Mode
80.60
10 / 28
HumanEval
Standard Mode
73.20
26 / 39
Compare with other models