LMArena Math Arena 数学推理能力排行榜
基于 LMArena Math Arena 用户匿名投票的最新AI大模型数学推理能力排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。
Top Model
-
Top Score
-
Model Count
0
Data version
暂无数据
Data source: LM Arena
About This Leaderboard
This leaderboard ranks AI models by mathematical reasoning ability. Data comes from LMArena's Math sub-track, evaluated through anonymous blind testing by real users on math problem-solving tasks.
Methodology Overview
Blind testing: Users submit math problems, two anonymous models provide solutions, and users vote for the better answer — eliminating brand bias.
Elo scoring: Uses the Bradley-Terry model to calculate Elo scores. Higher scores mean users more frequently prefer that model's math solutions.
Broad scenario coverage: Testing spans algebra, geometry, calculus, competition math, and more diverse real-world math tasks.
DataLearner provides in-depth analysis on top of the raw data, linking leaderboard models to the DataLearner model database so you can quickly access model details, API pricing, benchmark scores, and more.
Filters
Ranking Table
| Rank | Model | Score | 95% CI | Votes | Organization | License |
|---|
Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.