DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜LMArena Math Arena 数学推理能力排行榜

LMArena 评测赛道

文本生成代码数学图像编辑文字生成视频图生视频文生图

LMArena Math Arena 数学推理能力排行榜

基于 LMArena Math Arena 用户匿名投票的最新AI大模型数学推理能力排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Gemini 3.5 Flash

最高得分

1519.00

模型数量

354

数据版本

2026年06月05日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前 AI 大模型在数学推理任务中的实力排名。数据来源于 LMArena 的 Math 子赛道,通过真实用户匿名盲测投票评估各模型在数学解题任务中的表现。

评测方法概要

匿名盲测:用户提出数学题目后,由两个"隐藏身份"的模型分别作答,用户投票选出解题更优的一方,排除品牌偏见。

Elo 评分:采用 Bradley-Terry 模型计算 Elo 分数,分数越高说明该模型在数学场景中被用户更频繁地选择。

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
Google Deep MindGemini 3.5 FlashGoogle Deep Mind

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

01

什么是 LMArena Math Arena?

LMArena Math Arena 是 LMArena 旗下专注于数学推理能力的匿名评测平台。用户提交真实数学问题(如代数、几何、竞赛数学等),系统将不同模型的解题过程并排展示(隐藏模型名称),由用户投票选出更好的解答,最终通过 Elo 算法汇总形成动态排行榜。

02

Math Arena 与 MATH-500、AIME 等静态基准有什么区别?

MATH-500、AIME、AMC 等静态基准使用固定题目集和自动评分,可重现性强但容易被针对性优化("刷榜")。Math Arena 来自真实用户的开放式数学问题,测试内容不固定,更能反映模型在实际数学场景中的自然表现,两者互为补充。

03

思考模型(Thinking Model)在数学 Arena 中表现更好吗?

整体而言,具备思维链(Chain-of-Thought)或扩展推理能力的模型在数学 Arena 中往往排名更高。Claude Opus 系列 Thinking 模式、GPT 高算力模式以及 DeepSeek 思考版本均在榜单前列,说明延长推理时间对数学问题的解答质量有显著提升。

04

国产大模型在数学能力方面表现如何?

DeepSeek、Qwen3 系列、GLM 等国产模型在 Math Arena 表现亮眼,已跻身全球前列。DeepSeek 以 MIT 协议开源,Qwen3-235B 等系列支持中文数学场景,是选择开源数学推理模型的重要参考。

覆盖多种数学场景:包括代数、几何、计算推理、竞赛数学等多元化的真实数学任务。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

1519.00
+/-26
558
Google Deep Mind
Proprietary
AnthropicClaude Opus 4.6 (thinking)Anthropic1514.00+/-132,168AnthropicProprietary
OpenAIGPT-5.4 (high)OpenAI1511.00+/-141,911OpenAIProprietary
4AnthropicClaude Opus 4.6Anthropic1506.00+/-122,455AnthropicProprietary
5Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1499.00+/-112,911Google Deep MindProprietary
6AnthropicOpus 4.7 (thinking)Anthropic1495.00+/-161,395AnthropicProprietary
7阿里Qwen3.7-Max-Preview阿里巴巴1494.00+/-40220阿里巴巴Proprietary
8AnthropicOpus 4.7Anthropic1494.00+/-161,423AnthropicProprietary
9OpenAIGPT-5.5 (high)OpenAI1492.00+/-171,200OpenAIProprietary
10OpenAIGPT-5.5OpenAI1491.00+/-181,200OpenAIProprietary
11Anthropicclaude-opus-4-8Anthropic1488.00+/-31317AnthropicProprietary
12MiniMaxminimax-m3MiniMax1487.00+/-39236MiniMaxProprietary
13Moonshot AIKimi K2.6Moonshot AI1485.00+/-181,062Moonshot AIModified MIT
14XIMiMo V2.5 ProXiaomi1484.00+/-191,035XiaomiMIT
15阿里Qwen3.6-Max-Preview阿里巴巴1482.00+/-30349阿里巴巴Proprietary
16Anthropicclaude-opus-4-8-thinkingAnthropic1481.00+/-33282AnthropicProprietary
17百度ERNIE-5.1-Preview百度1480.00+/-181,033百度Proprietary
18智谱GLM 5.1智谱AI1479.00+/-20915智谱AIMIT
19Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1478.00+/-112,655Google Deep MindProprietary
20Google Deep MindGemini 3.0 FlashGoogle Deep Mind1476.00+/-132,004Google Deep MindProprietary
21DeepSeek-AIDeepSeek-V4-Pro (thinking)DeepSeek-AI1472.00+/-181,056DeepSeek-AIMIT
22Moonshot AIKimi K2 ThinkingMoonshot AI1472.00+/-122,467Moonshot AIModified MIT
23AnthropicClaude Opus 4 (thinking-32k)Anthropic1470.00+/-122,267AnthropicProprietary
24xAIGrok 4.20 Beta ReasoningxAI1470.00+/-141,980xAIProprietary
25DeepMindGemma 4 31BDeepMind1470.00+/-28398DeepMindApache 2.0
26阿里Qwen3.5 Max Preview阿里巴巴1469.00+/-161,314阿里巴巴Proprietary
27FAMuse SparkFacebook AI研究实验室1468.00+/-20825Facebook AI研究实验室Proprietary
28DeepMindGemma 4 26B A4BDeepMind1467.00+/-28372DeepMindApache 2.0
29AnthropicClaude Opus 4Anthropic1466.00+/-94,280AnthropicProprietary
30OpenAIGPT-5.5 InstantOpenAI1465.00+/-161,460OpenAIProprietary
31AnthropicClaude Sonnet 4.6Anthropic1460.00+/-141,926AnthropicProprietary
32OpenAIGPT-5.2 Pro (high)OpenAI1459.00+/-112,943OpenAIProprietary
33阿里Qwen 3.6 Plus Preview阿里巴巴1457.00+/-161,332阿里巴巴Proprietary
34OpenAIGPT-5.4OpenAI1457.00+/-131,994OpenAIProprietary
35Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1456.00+/-103,427Google Deep MindProprietary
36OpenAIGPT-5.1 Pro (high)OpenAI1455.00+/-122,499OpenAIProprietary
37AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1455.00+/-94,862AnthropicProprietary
38OpenAIGPT-5.2OpenAI1454.00+/-132,038OpenAIProprietary
39XIMiMo V2 ProXiaomi1454.00+/-151,585XiaomiProprietary
40xAIGrok 4.20 BetaxAI1452.00+/-151,563xAIProprietary
41xAIGrok 4.20 Multi-AgentxAI1452.00+/-141,958xAIProprietary
42字节DOLA Seed 2.0 Pro字节跳动Seed团队1450.00+/-122,513字节跳动Seed团队Proprietary
43XImimo-v2-omniXiaomi1450.00+/-35289XiaomiProprietary
44阿里Qwen3.5-397B-A17B阿里巴巴1449.00+/-122,267阿里巴巴Apache 2.0
45OpenAIOpenAI o3OpenAI1448.00+/-103,730OpenAIProprietary
46Mistralmistral-medium-3.5Mistral1445.00+/-31314MistralModified MIT
47xAIGrok 4.1 ThinkingxAI1444.00+/-103,789xAIProprietary
48AnthropicOpus 4.1 (thinking-16k)Anthropic1443.00+/-113,026AnthropicProprietary
49Moonshot AIKimi K2.5 InstantMoonshot AI1442.00+/-25513Moonshot AIModified MIT
50Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1442.00+/-77,598Google Deep MindProprietary
51OpenAIGPT-5.4 mini (high)OpenAI1441.00+/-141,837OpenAIProprietary
52XIMiMo V2.5Xiaomi1440.00+/-171,114XiaomiMIT
53Moonshot AIKimi K2 Thinking (thinking-turbo)Moonshot AI1440.00+/-103,746Moonshot AIModified MIT
54DeepSeek-AIDeepSeek-V4-Flash (thinking)DeepSeek-AI1440.00+/-181,151DeepSeek-AIMIT
55Googlegemini-3.1-flash-lite-previewGoogle1439.00+/-122,401GoogleProprietary
56智谱GLM-5智谱AI1439.00+/-161,378智谱AIMIT
57阿里Qwen3 Max (Preview)阿里巴巴1439.00+/-151,525阿里巴巴Proprietary
58DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1439.00+/-171,191DeepSeek-AIMIT
59DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1438.00+/-171,263DeepSeek-AIMIT
60百度ERNIE 5.0百度1438.00+/-132,126百度Proprietary
61OpenAIGPT-5.4 nano (high)OpenAI1437.00+/-151,687OpenAIProprietary
62MeituanLongCat Flash Chat (2602)Meituan1437.00+/-141,704MeituanProprietary
63OpenAIGPT-5.2OpenAI1434.00+/-113,052OpenAIProprietary
64OpenAIGPT-5-Pro (high)OpenAI1434.00+/-141,886OpenAIProprietary
65AnthropicOpus 4.1Anthropic1433.00+/-94,724AnthropicProprietary
66xAIGrok 4.1xAI1430.00+/-94,200xAIProprietary
67阿里Qwen3.5-27B阿里巴巴1430.00+/-151,620阿里巴巴Apache 2.0
68Alibabaqwen3-max-2025-09-23Alibaba1429.00+/-24584AlibabaProprietary
69DeepSeek-AIDeepSeek V3.2DeepSeek-AI1429.00+/-112,980DeepSeek-AIMIT
70智谱GLM-4.7智谱AI1429.00+/-21710智谱AIMIT
71Amazonamazon-nova-experimental-chat-26-02-10Amazon1428.00+/-39207AmazonProprietary
72DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1428.00+/-27480DeepSeek-AIMIT
73xAIGrok 4xAI1428.00+/-122,265xAIProprietary
74AnthropicClaude Sonnet 4.5Anthropic1427.00+/-94,867AnthropicProprietary
75Tencenthunyuan-hy3-previewTencent1426.00+/-28390Tencenttencent-hunyuan-community
76DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1426.00+/-122,483DeepSeek-AIMIT
77OpenAIGPT-5.3OpenAI1425.00+/-142,003OpenAIProprietary
78xAIGrok 4 FastxAI1424.00+/-29399xAIProprietary
79OpenAIGPT-5.1 InstantOpenAI1424.00+/-112,865OpenAIProprietary
80xAIGrok 4.3 BetaxAI1424.00+/-181,064xAIProprietary
81阿里Qwen3.5-122B-A10B阿里巴巴1422.00+/-141,729阿里巴巴Apache 2.0
82智谱GLM-4.6智谱AI1421.00+/-132,107智谱AIMIT
83AnthropicClaude Opus 4 (thinking-16k)Anthropic1420.00+/-122,240AnthropicProprietary
84阿里Qwen3-235B-A22B-2507阿里巴巴1420.00+/-85,893阿里巴巴Apache 2.0
85阿里Qwen3-Next阿里巴巴1419.00+/-171,212阿里巴巴Apache 2.0
86xAIGrok 4.1 Fast (fast-reasoning)xAI1419.00+/-103,449xAIProprietary
87DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1418.00+/-21775DeepSeek-AIMIT
88MeituanLongCat Flash Chat (2602)Meituan1417.00+/-22688MeituanMIT
89Moonshot AIKimi K2 0905Moonshot AI1416.00+/-21759Moonshot AIModified MIT
90OpenAIOpenAI o4 - miniOpenAI1416.00+/-112,938OpenAIProprietary
91DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1415.00+/-18993DeepSeek-AIMIT
92DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1414.00+/-22663DeepSeek-AIMIT
93智谱GLM-4.5智谱AI1413.00+/-151,425智谱AIMIT
94OpenAIGPT-5OpenAI1413.00+/-141,786OpenAIProprietary
95Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1413.00+/-131,944Google Deep MindProprietary
96MiniMaxAIMiniMax-M2.7MiniMaxAI1413.00+/-151,595MiniMaxAIModified MIT
97xAIGrok 4 Fast (fast-reasoning)xAI1412.00+/-181,085xAIProprietary
98DeepSeek-AIDeepSeek-R1DeepSeek-AI1411.00+/-141,606DeepSeek-AIMIT
99阿里Qwen3-VL-235B-A22B-Instruct阿里巴巴1411.00+/-23704阿里巴巴Apache 2.0
100DeepSeek-AIDeepSeek-V3.1 Terminus (thinking)DeepSeek-AI1410.00+/-40201DeepSeek-AIMIT
101Amazonamazon-nova-experimental-chat-26-01-10Amazon1410.00+/-33263AmazonProprietary
102OpenAIGPT-4.5OpenAI1409.00+/-151,393OpenAIProprietary
103OpenAIOpenAI o1OpenAI1409.00+/-112,986OpenAIProprietary
104百度ERNIE 5.0百度1408.00+/-23619百度Proprietary
105StepFunAIStep 3.5 FlashStepFunAI1408.00+/-122,317StepFunAIApache 2.0
106Google Deep MindGemini 2.5 FlashGoogle Deep Mind1406.00+/-77,837Google Deep MindProprietary
107OpenAIGPT-5-mini (high)OpenAI1406.00+/-151,460OpenAIProprietary
108OpenAIOpenAI o3-mini (high)OpenAI1406.00+/-131,909OpenAIProprietary
109阿里Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴1405.00+/-28428阿里巴巴Apache 2.0
110OpenAIGPT-4o(2025-03-27)OpenAI1404.00+/-85,726OpenAIProprietary
111StepFunAIStep 3.5 FlashStepFunAI1403.00+/-132,055StepFunAIProprietary
112AnthropicClaude Opus 4Anthropic1403.00+/-112,769AnthropicProprietary
113AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1403.00+/-132,023AnthropicProprietary
114腾讯Hunyuan-T1腾讯AI实验室1401.00+/-38236腾讯AI实验室Proprietary
115MistralAIMistral Large 3MistralAI1401.00+/-112,766MistralAIApache 2.0
116阿里Qwen3.5-35B-A3B阿里巴巴1401.00+/-141,721阿里巴巴Apache 2.0
117Amazonamazon-nova-experimental-chat-12-10Amazon1400.00+/-37234AmazonProprietary
118百度ERNIE 5.0百度1400.00+/-34268百度Proprietary
119MistralAIMagistral-Medium-2506MistralAI1399.00+/-85,780MistralAIProprietary
120阿里Qwen3-32B阿里巴巴1399.00+/-30316阿里巴巴Apache 2.0
121Amazonamazon-nova-experimental-chat-11-10Amazon1398.00+/-151,584AmazonProprietary
122Alibabaqwen3-235b-a22b-thinking-2507Alibaba1398.00+/-24489AlibabaApache 2.0
123AnthropicHaiku 4.5Anthropic1397.00+/-94,985AnthropicProprietary
124DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1396.00+/-20869DeepSeek-AIMIT
125MiniMaxAIMiniMax M2.5MiniMaxAI1396.00+/-122,387MiniMaxAIModified MIT
126DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1395.00+/-39219DeepSeek-AIMIT
127Amazonamazon-nova-experimental-chat-10-20Amazon1395.00+/-20806AmazonProprietary
128Alibabaqwen3-235b-a22b-no-thinkingAlibaba1394.00+/-122,390AlibabaApache 2.0
129阿里Qwen3-235B-A22B阿里巴巴1393.00+/-141,604阿里巴巴Apache 2.0
130MiniMaxAIM2.1MiniMaxAI1393.00+/-181,010MiniMaxAIMIT
131智谱GLM-4.5-Air智谱AI1390.00+/-151,540智谱AIMIT
132Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1390.00+/-39194NvidiaNvidia Open
133阿里Qwen3-Next (thinking)阿里巴巴1389.00+/-20829阿里巴巴Apache 2.0
134Moonshot AIKimi K2Moonshot AI1388.00+/-141,693Moonshot AIModified MIT
135OpenAIOpenAI o3-mini (high)OpenAI1388.00+/-18977OpenAIProprietary
136AnthropicClaude Sonnet 4Anthropic1388.00+/-122,475AnthropicProprietary
137ARtrinity-large-thinkingArcee AI1388.00+/-161,551Arcee AIApache 2.0
138OpenAIOpenAI o1OpenAI1386.00+/-104,569OpenAIProprietary
139AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1384.00+/-112,793AnthropicProprietary
140PRintellect-3Prime Intellect1383.00+/-31333Prime IntellectMIT
141OpenAIGPT OSS 120BOpenAI1382.00+/-141,793OpenAIApache 2.0
142OpenAIOpenAI o3-miniOpenAI1382.00+/-84,722OpenAIProprietary
143阿里Qwen3-30B-A3B-2507阿里巴巴1381.00+/-151,426阿里巴巴Apache 2.0
144XImimo-v2-flash (non-thinking)Xiaomi1380.00+/-112,793XiaomiMIT
145Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1380.00+/-37209NvidiaNvidia Open Model
146阿里Qwen3-Coder-480B-A35B阿里巴巴1377.00+/-151,627阿里巴巴Apache 2.0
147Nvidianvidia-nemotron-3-super-120b-a12bNvidia1375.00+/-25515NvidiaNVIDIA Open Model
148xAIGrok 3xAI1375.00+/-112,677xAIProprietary
149XImimo-v2-flash (thinking)Xiaomi1374.00+/-22633XiaomiMIT
150OpenAIGPT-4.1OpenAI1374.00+/-103,227OpenAIProprietary
151MiniMaxminimax-m1MiniMax1371.00+/-131,797MiniMaxApache 2.0
152DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1370.00+/-103,191DeepSeek-AIMIT
153xAIgrok-3-mini-betaxAI1369.00+/-141,528xAIProprietary
154智谱GLM-4.7-Flash智谱AI1366.00+/-21718智谱AIMIT
155Google Deep MindGemini 2.5 Flash-Lite (thinking)Google Deep Mind1365.00+/-122,095Google Deep MindProprietary
156Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind1365.00+/-112,877Google Deep MindProprietary
157阿里QwQ-32B阿里巴巴1364.00+/-141,720阿里巴巴Apache 2.0
158阿里Qwen2.5-Max阿里巴巴1364.00+/-103,305阿里巴巴Proprietary
159StepFunAIStep3StepFunAI1364.00+/-31352StepFunAIApache 2.0
160ARtrinity-large-previewArcee AI1362.00+/-141,857Arcee AIApache 2.0
161AnthropicClaude Sonnet 3.7Anthropic1362.00+/-103,358AnthropicProprietary
162OpenAIOpenAI o1-miniOpenAI1362.00+/-87,499OpenAIProprietary
163智谱GLM-4.5V智谱AI1357.00+/-34277智谱AIMIT
164DeepMindGemini 2.0 Flash ExperimentalDeepMind1356.00+/-94,067DeepMindProprietary
165MiniMaxAIMiniMax M2MiniMaxAI1356.00+/-33319MiniMaxAIApache 2.0
166ANling-flash-2.0Ant Group1355.00+/-27460Ant GroupMIT
167OpenAIGPT-4.1 miniOpenAI1354.00+/-112,694OpenAIProprietary
168Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1354.00+/-19987NvidiaNVIDIA Open Model
169阿里Qwen3-30B-A3B阿里巴巴1353.00+/-141,707阿里巴巴Apache 2.0
170AnthropicClaude 3.5 SonnetAnthropic1350.00+/-710,019AnthropicProprietary
171Mistralmistral-medium-2505Mistral1349.00+/-122,228MistralProprietary
172Tencenthunyuan-turbos-20250416Tencent1348.00+/-20845TencentProprietary
173OpenAIGPT-5-Nano (high)OpenAI1345.00+/-27494OpenAIProprietary
174AnthropicClaude 3.5 SonnetAnthropic1342.00+/-711,359AnthropicProprietary
175ANring-flash-2.0Ant Group1340.00+/-27454Ant GroupMIT
176MistralAIMistral-Small-3.2MistralAI1339.00+/-181,042MistralAIApache 2.0
177Google Deep MindGemini 1.5 ProGoogle Deep Mind1339.00+/-77,610Google Deep MindProprietary
178OpenAIGPT OSS 20BOpenAI1336.00+/-22680OpenAIApache 2.0
179亚马Nova 2 Lite亚马逊1335.00+/-20825亚马逊Proprietary
180DeepMindGemini 2.0 Flash-LiteDeepMind1326.00+/-102,814DeepMindProprietary
181Alibabaqwen-plus-0125Alibaba1324.00+/-19732AlibabaProprietary
182IBgranite-4.1-8bIBM1324.00+/-39229IBMApache 2.0
183Google Deep MindGemma 3 - 27B (IT)Google Deep Mind1322.00+/-93,581Google Deep MindGemma
184Metallama-3.1-405b-instruct-fp8Meta1319.00+/-88,482MetaLlama 3.1 Community
185FALlama 4 Maverick InstructFacebook AI研究实验室1319.00+/-112,839Facebook AI研究实验室Llama 4
186Google Deep MindGemma 3 - 12B (IT)Google Deep Mind1318.00+/-27389Google Deep MindGemma
187Metallama-3.1-405b-instruct-bf16Meta1315.00+/-85,215MetaLlama 3.1 Community
188StepFunstep-2-16k-exp-202412StepFun1313.00+/-20642StepFunProprietary
189NEathene-v2-chatNexusFlow1312.00+/-93,412NexusFlowNexusFlow
190AnthropicClaude3-OpusAnthropic1312.00+/-625,769AnthropicProprietary
191AIolmo-3-32b-thinkAi21311.00+/-32314Ai2Apache 2.0
192DeepSeek-AIDeepSeek-V3DeepSeek-AI1311.00+/-112,721DeepSeek-AIDeepSeek
193CohereAIC4AI Command A (202503)CohereAI1309.00+/-93,993CohereAICC-BY-NC-4.0
194FALlama 4 Scout InstructFacebook AI研究实验室1309.00+/-131,944Facebook AI研究实验室Llama
195OpenAIGPT-4oOpenAI1308.00+/-86,826OpenAIProprietary
196AIolmo-3.1-32b-instructAi21306.00+/-23697Ai2Apache 2.0
19701yi-lightning01 AI1306.00+/-103,92101 AIProprietary
198Googlegemini-advanced-0514Google1305.00+/-106,395GoogleProprietary
199OpenAIGPT-4oOpenAI1305.00+/-715,103OpenAIProprietary
200Alibabaqwen2.5-plus-1127Alibaba1304.00+/-141,404AlibabaProprietary
201OpenAIGPT-4OpenAI1303.00+/-813,306OpenAIProprietary
202Tencenthunyuan-turbos-20250226Tencent1301.00+/-31238TencentProprietary
203OpenAIGPT-4OpenAI1299.00+/-812,374OpenAIProprietary
204StepFunstep-1o-turbo-202506StepFun1299.00+/-24564StepFunProprietary
205ZHglm-4-plus-0111Zhipu1298.00+/-19721ZhipuProprietary
206Google Deep MindGemini 1.5 ProGoogle Deep Mind1297.00+/-810,492Google Deep MindProprietary
207AIolmo-3.1-32b-thinkAi21297.00+/-26473Ai2Apache 2.0
208阿里Qwen2.5-VL-72B-Instruct阿里巴巴1296.00+/-85,415阿里巴巴Qwen
209RURunway Gen-4 TurboRunway1296.00+/-813,217RunwayProprietary
210FALlama3.3-70B-InstructFacebook AI研究实验室1296.00+/-85,777Facebook AI研究实验室Llama-3.3
211xAIGrok 2xAI1294.00+/-78,950xAIProprietary
212Tencenthunyuan-large-2025-02-10Tencent1293.00+/-24497TencentProprietary
213DeepSeekdeepseek-v2.5-1210DeepSeek1293.00+/-171,031DeepSeekDeepSeek
214Alibabaqwen-max-0919Alibaba1291.00+/-122,249AlibabaQwen
215Tencenthunyuan-standard-2025-02-10Tencent1290.00+/-24499TencentProprietary
216Googlegemini-1.5-flash-002Google1288.00+/-94,789GoogleProprietary
217Mistralmistral-large-2407Mistral1288.00+/-86,664MistralMistral Research
218DeepSeek-AIDeepSeek V2.5DeepSeek-AI1288.00+/-103,649DeepSeek-AIDeepSeek
219ZHglm-4-plusZhipu AI1287.00+/-103,599Zhipu AIProprietary
220AnthropicClaude 3.5 HaikuAnthropic1286.00+/-76,364AnthropicProprietary
221MistralAIMagistral-Medium-2506MistralAI1285.00+/-26553MistralAIProprietary
222OpenAIGPT-4OpenAI1283.00+/-107,052OpenAIProprietary
223Mistralmistral-large-2411Mistral1282.00+/-93,574MistralMRL
224Tencenthunyuan-large-visionTencent1280.00+/-30351TencentProprietary
225Tencenthunyuan-turbo-0110Tencent1279.00+/-31243TencentProprietary
226IBibm-granite-h-smallIBM1279.00+/-32358IBMApache 2.0
227FALlama3.1-70B-InstructFacebook AI研究实验室1279.00+/-171,041Facebook AI研究实验室Llama 3.1
228MistralAIMistral-Small-3.1-24B-Instruct-2503MistralAI1278.00+/-132,131MistralAIApache 2.0
229OpenAIGPT-4o miniOpenAI1276.00+/-79,322OpenAIProprietary
230OpenAIGPT-4OpenAI1275.00+/-811,181OpenAIProprietary
231OpenAIGPT-4.1 nanoOpenAI1274.00+/-23582OpenAIProprietary
232阿里Qwen2-72B-Instruct阿里巴巴1273.00+/-94,835阿里巴巴Qianwen LICENSE
233xAIgrok-2-mini-2024-08-13xAI1273.00+/-87,261xAIProprietary
234DeepSeekdeepseek-coder-v2DeepSeek1271.00+/-131,858DeepSeekDeepSeek License
235Nvidiallama-3.1-nemotron-51b-instructNvidia1271.00+/-22507NvidiaLlama 3.1
236阿里Qwen2.5-Coder-32B-Instruct阿里巴巴1270.00+/-19725阿里巴巴Apache 2.0
237FALlama3.1-70B-InstructFacebook AI研究实验室1269.00+/-87,677Facebook AI研究实验室Llama 3.1 Community
238Amazonamazon-nova-pro-v1.0Amazon1269.00+/-102,978AmazonProprietary
239Microsoft AzurePhi 4 - 14BMicrosoft Azure1265.00+/-102,764Microsoft AzureMIT
240AIllama-3.1-tulu-3-70bAi21264.00+/-25397Ai2Llama 3.1
241MistralAIMistral Small 24B Instruct 2501MistralAI1261.00+/-131,683MistralAIApache 2.0
242NEathene-70b-0725NexusFlow1261.00+/-102,921NexusFlowCC-BY-NC-4.0
243Google Deep MindGemma-3n-E4BGoogle Deep Mind1260.00+/-151,572Google Deep MindGemma
244FALlama3-70B-InstructFacebook AI研究实验室1257.00+/-720,941Facebook AI研究实验室Llama 3 Community
245Googlegemini-1.5-flash-001Google1257.00+/-88,392GoogleProprietary
246Google Deep MindGemma 3 - 4B (IT)Google Deep Mind1254.00+/-28423Google Deep MindGemma
247AnthropicClaude3-SonnetAnthropic1253.00+/-813,766AnthropicProprietary
248Nvidianemotron-4-340b-instructNvidia1252.00+/-122,352NvidiaNVIDIA Open Model
249Tencenthunyuan-standard-256kTencent1250.00+/-29361TencentProprietary
250智谱GLM4智谱AI1247.00+/-161,191智谱AIProprietary
251REreka-core-20240904Reka AI1246.00+/-141,207Reka AIProprietary
252Googlegemma-2-27b-itGoogle1245.00+/-710,170GoogleGemma license
253AIjamba-1.5-largeAI21 Labs1245.00+/-151,147AI21 LabsJamba Open
254Amazonamazon-nova-lite-v1.0Amazon1244.00+/-112,511AmazonProprietary
255Mistralmistral-large-2402Mistral1244.00+/-97,987MistralProprietary
256CohereAIC4AI Aya Vision 32BCohereAI1232.00+/-103,854CohereAICC-BY-NC-4.0
257REreka-flash-20240904Reka AI1232.00+/-141,284Reka AIProprietary
258AnthropicClaude3-HaikuAnthropic1231.00+/-714,983AnthropicProprietary
259Coherecommand-r-plus-08-2024Cohere1231.00+/-141,467CohereCC-BY-NC-4.0
260Googlegemini-1.5-flash-8b-001Google1229.00+/-85,036GoogleProprietary
261MistralAIMixtral-8x22B-Instruct-v0.1MistralAI1228.00+/-96,778MistralAIApache 2.0
262AIolmo-2-0325-32b-instructAi21227.00+/-28375Ai2Apache-2.0
263Amazonamazon-nova-micro-v1.0Amazon1224.00+/-112,455AmazonProprietary
264阿里Qwen1.5-110B-Chat阿里巴巴1221.00+/-113,188阿里巴巴Qianwen LICENSE
265Mistralmistral-mediumMistral1220.00+/-114,406MistralProprietary
266Googlegemma-2-9b-itGoogle1218.00+/-87,110GoogleGemma license
267Microsoft AzurePhi-3-medium 14B-previewMicrosoft Azure1215.00+/-113,238Microsoft AzureMIT
268Mistralministral-8b-2410Mistral1213.00+/-20683MistralMRL
269CohereAIC4AI Command R+CohereAI1213.00+/-89,769CohereAICC-BY-NC-4.0
270零一Yi-1.5-34B零一万物1213.00+/-112,985零一万物Apache-2.0
271阿里QwQ-32B-Preview阿里巴巴1213.00+/-24480阿里巴巴Apache 2.0
272REreka-flash-21b-20240226-onlineReka AI1211.00+/-142,028Reka AIProprietary
273阿里Qwen1.5-72B-Chat阿里巴巴1208.00+/-105,327阿里巴巴Qianwen LICENSE
274上海InternLM2-Base-20B上海人工智能实验室1207.00+/-151,387上海人工智能实验室Other
275AIllama-3.1-tulu-3-8bAi21207.00+/-26363Ai2Llama 3.1
276Coherecommand-r-08-2024Cohere1206.00+/-141,601CohereCC-BY-NC-4.0
277PRgemma-2-9b-it-simpoPrinceton1205.00+/-151,285PrincetonMIT
278OpenAIgpt-3.5-turbo-1106OpenAI1203.00+/-152,134OpenAIProprietary
279Alibabaqwen1.5-32b-chatAlibaba1200.00+/-122,649AlibabaQianwen LICENSE
280CohereAIC4AI Aya Vision 8BCohereAI1200.00+/-151,307CohereAICC-BY-NC-4.0
281OpenAIgpt-3.5-turbo-0125OpenAI1200.00+/-88,626OpenAIProprietary
282DeepMindGemini-proDeepMind1199.00+/-19993DeepMindProprietary
283REreka-flash-21b-20240226Reka AI1199.00+/-113,363Reka AIProprietary
284IBgranite-3.1-2b-instructIBM1197.00+/-26391IBMApache 2.0
285IBgranite-3.0-8b-instructIBM1197.00+/-19873IBMApache 2.0
286HUzephyr-orpo-141b-A35b-v0.1HuggingFace1196.00+/-22589HuggingFaceApache 2.0
287DADBRX Instructdatabricks1196.00+/-114,001databricksDBRX LICENSE
288Googlegemini-pro-dev-apiGoogle1195.00+/-142,274GoogleProprietary
289Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1193.00+/-141,568Microsoft AzureMIT
290Microsoft AzurePhi-3-small 7BMicrosoft Azure1193.00+/-132,092Microsoft AzureMIT
291FALlama3-8B-InstructFacebook AI研究实验室1192.00+/-814,252Facebook AI研究实验室Llama 3 Community
292Mistralmixtral-8x7b-instruct-v0.1Mistral1191.00+/-99,663MistralApache 2.0
293FALlama3.1-8B-InstructFacebook AI研究实验室1190.00+/-28382Facebook AI研究实验室Apache 2.0
294FALlama3.1-8B-InstructFacebook AI研究实验室1189.00+/-87,135Facebook AI研究实验室Llama 3.1 Community
295AIjamba-1.5-miniAI21 Labs1186.00+/-161,094AI21 LabsJamba Open
296Coherecommand-rCohere1175.00+/-96,682CohereCC-BY-NC-4.0
297阿里Qwen3-VL-2B阿里巴巴1168.00+/-19908阿里巴巴Apache 2.0
298阿里Qwen1.5-14B-Chat阿里巴巴1167.00+/-142,184阿里巴巴Qianwen LICENSE
299Metallama-3.2-3b-instructMeta1165.00+/-161,136MetaLlama 3.2
300Googlegemma-2-2b-itGoogle1162.00+/-86,599GoogleGemma license
301SNsnowflake-arctic-instructSnowflake1162.00+/-114,793SnowflakeApache 2.0
302Google ResearchGemma 1.1-7B-ITGoogle Research1159.00+/-113,039Google ResearchGemma license
303NEstarling-lm-7b-betaNexusflow1158.00+/-141,973NexusflowApache-2.0
304OPopenchat-3.5-0106OpenChat1158.00+/-141,726OpenChatApache-2.0
305WIWizardLM-70B-V1.0WizardLM Team1158.00+/-19903WizardLM TeamLlama 2 Community
306DeepSeek-AIDeepSeek LLM 67B ChatDeepSeek-AI1155.00+/-23576DeepSeek-AIDeepSeek License
307HUsmollm2-1.7b-instructHuggingFace1152.00+/-33271HuggingFaceApache 2.0
308NOopenhermes-2.5-mistral-7bNousResearch1151.00+/-20697NousResearchApache-2.0
309零一Yi-34B零一万物1151.00+/-132,043零一万物Yi License
310Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1150.00+/-122,564Microsoft AzureMIT
311ALtulu-2-dpo-70bAllenAI/UW1145.00+/-19888AllenAI/UWAI2 ImpACT Low-risk
312Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1139.00+/-132,813Microsoft AzureMIT
313Metallama-2-70b-chatMeta1136.00+/-104,740MetaLlama 2 Community
314MistralAIMistral-7B-Instruct-v0.2MistralAI1127.00+/-122,605MistralAIApache-2.0
315UCstarling-lm-7b-alphaUC Berkeley1126.00+/-161,300UC BerkeleyCC-BY-NC-4.0
316阿里Qwen-14B-Chat阿里巴巴1125.00+/-24534阿里巴巴Qianwen LICENSE
317COdolphin-2.2.1-mistral-7bCognitive Computations1125.00+/-32219Cognitive ComputationsApache-2.0
318OPopenchat-3.5OpenChat1125.00+/-18945OpenChatApache-2.0
319Metallama-3.2-1b-instructMeta1124.00+/-161,162MetaLlama 3.2
320阿里Qwen1.5-7B-Chat阿里巴巴1120.00+/-20690阿里巴巴Qianwen LICENSE
321Google ResearchGemma 7B - ItGoogle Research1117.00+/-161,120Google ResearchGemma license
322LMVicuna 33BLM-SYS1115.00+/-132,663LM-SYSNon-commercial
323Google ResearchPaLM 2Google Research1115.00+/-19901Google ResearchProprietary
324Nvidiallama2-70b-steerlm-chatNvidia1114.00+/-27440NvidiaLlama 2 Community
325百川Baichuan2-13B-Chat百川智能1110.00+/-132,218百川智能Llama 2 Community
326UPsolar-10.7b-instruct-v1.0Upstage AI1109.00+/-22604Upstage AICC-BY-NC-4.0
327FACodeLLaMA-34BFacebook AI研究实验室1109.00+/-19770Facebook AI研究实验室Llama 2 Community
328Google ResearchGemma 1.1-2B-ITGoogle Research1107.00+/-161,355Google ResearchGemma license
329MOMPT-30B-ChatMosaicML1095.00+/-34242MosaicMLCC-BY-NC-SA-4.0
330NOnous-hermes-2-mixtral-8x7b-dpoNousResearch1093.00+/-21628NousResearchApache-2.0
331百川Baichuan2-7B-Chat百川智能1086.00+/-141,656百川智能Llama 2 Community
332阿里Qwen1.5-4B-Chat阿里巴巴1086.00+/-18988阿里巴巴Qianwen LICENSE
333TOstripedhyena-nous-7bTogether AI1084.00+/-20676Together AIApache 2.0
334LMVicuna 13BLM-SYS1082.00+/-142,146LM-SYSLlama 2 Community
335HUzephyr-7b-betaHuggingFace1082.00+/-171,250HuggingFaceMIT
336MistralAIMistral 7B InstructMistralAI1082.00+/-19974MistralAIApache 2.0
337UWguanaco-33bUW1080.00+/-32280UWNon-commercial
338Google ResearchGemma 2B - ItGoogle Research1070.00+/-22597Google ResearchGemma license
339Microsoftwizardlm-13bMicrosoft1064.00+/-21669MicrosoftLlama 2 Community
340AIolmo-7b-instructAi21054.00+/-19848Ai2Apache-2.0
341LMVicuna 7BLM-SYS1047.00+/-22658LM-SYSLlama 2 Community
342智谱ChatGLM3-6B智谱AI1042.00+/-23576智谱AIApache-2.0
343NOGPT4All 13BNomic AI998.00+/-37211Nomic AINon-commercial
344STalpaca-13bStanford991.00+/-23652StanfordNon-commercial
345MOMPT-7B-ChatMosaicML985.00+/-25471MosaicMLCC-BY-NC-SA-4.0
346RWRWKV-4-Raven-14BRWKV983.00+/-24544RWKVApache 2.0
347达摩Koala达摩院980.00+/-21751达摩院Non-commercial
348智谱ChatGLM-6B智谱AI976.00+/-26525智谱AINon-commercial
349智谱ChatGLM2-6B智谱AI971.00+/-35227智谱AIApache-2.0
350OPoasst-pythia-12bOpenAssistant959.00+/-22687OpenAssistantApache 2.0
351DAdolly-v2-12bDatabricks950.00+/-29370DatabricksMIT
352LMfastchat-t5-3bLMSYS919.00+/-26462LMSYSApache 2.0
353FALLaMA 13BFacebook AI研究实验室919.00+/-33252Facebook AI研究实验室Non-commercial
354STstablelm-tuned-alpha-7bStability AI890.00+/-29353Stability AICC-BY-NC-SA-4.0