DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
Page navigation
Model catalogQwen3-235B-A22B-ThinkingBenchmark analysis

Qwen3-235B-A22B-Thinking Benchmark Details

Qwen3-235B-A22B-Thinking currently shows benchmark results led by Creative Writing (5 / 23, score 86.10), MMLU Pro (32 / 124, score 84.40), AIME2025 (33 / 106, score 92.30).

Benchmark Results

Qwen3-235B-A22B-Thinking

Benchmark Results

Thinking

综合评估

4 evaluations
Benchmark / mode
Score
Rank/total
MMLU Pro
Thinking Enabled
84.40
32 / 124
GPQA Diamond
Thinking Enabled
81.10
64 / 175
LiveBench
Thinking Enabled
63.42
39 / 52
HLE
Thinking Enabled
18.20
101 / 149

编程与软件工程

1 evaluations
Benchmark / mode
Score
Rank/total
LiveCodeBench
Thinking Enabled
74.10
39 / 118

数学推理

3 evaluations
Benchmark / mode
Score
Rank/total
AIME2025
Thinking Enabled
92.30
33 / 106
IMO-ProofBench
Thinking Enabled
33.30
6 / 16
IMO-ProofBench Advanced
Thinking Enabled
5.20
5 / 8

写作和创作

1 evaluations
Benchmark / mode
Score
Rank/total
Creative Writing
Thinking Enabled
86.10
5 / 23
Compare with other models