DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Text Generation Arena 文本生成模型排行榜

LMArena 评测赛道

文本生成代码数学图像编辑文字生成视频图生视频文生图

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Claude Opus 4.6 (thinking)

最高得分

1,502

模型数量

360

数据版本

2026年05月17日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicClaude Opus 4.6 (thinking)Anthropic1,502+/-427,454AnthropicProprietary
AnthropicOpus 4.7 (thinking)Anthropic1,500+/-612,920AnthropicProprietary
AnthropicClaude Opus 4.6Anthropic1,498+/-429,240AnthropicProprietary
4AnthropicOpus 4.7Anthropic1,492+/-613,571AnthropicProprietary
5FAMuse SparkFacebook AI研究实验室1,489+/-611,103Facebook AI研究实验室Proprietary
6Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1,488+/-434,189Google Deep MindProprietary
7Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1,486+/-441,331Google Deep MindProprietary
8OpenAIGPT-5.5 (high)OpenAI1,481+/-610,172OpenAIProprietary
9Google Deep MindGemini 3.5 FlashGoogle Deep Mind1,480+/-85,907Google Deep MindProprietary
10OpenAIGPT-5.4 (high)OpenAI1,480+/-521,023OpenAIProprietary
11OpenAIGPT-5.5OpenAI1,478+/-610,294OpenAIProprietary
12xAIGrok 4.20 BetaxAI1,478+/-522,458xAIProprietary
13OpenAIGPT-5.2OpenAI1,477+/-427,988OpenAIProprietary
14阿里Qwen3.7-Max-Preview阿里巴巴1,475+/-103,741阿里巴巴Proprietary
15xAIGrok 4.20 Beta ReasoningxAI1,475+/-521,572xAIProprietary
16xAIGrok 4.20 Multi-AgentxAI1,474+/-521,565xAIProprietary
17Google Deep MindGemini 3.0 FlashGoogle Deep Mind1,473+/-430,742Google Deep MindProprietary
18百度ERNIE-5.1-Preview百度1,473+/-79,004百度Proprietary
19AnthropicClaude Opus 4 (thinking-32k)Anthropic1,473+/-437,130AnthropicProprietary
20智谱GLM 5.1智谱AI1,472+/-612,295智谱AIMIT
21OpenAIGPT-5.5 InstantOpenAI1,472+/-610,790OpenAIProprietary
22AnthropicClaude Sonnet 4.6Anthropic1,468+/-520,839AnthropicProprietary
23AnthropicClaude Opus 4Anthropic1,468+/-358,884AnthropicProprietary
24OpenAIGPT-5.4OpenAI1,467+/-522,146OpenAIProprietary
25xAIGrok 4.1 ThinkingxAI1,467+/-359,368xAIProprietary
26XIMiMo V2.5 ProXiaomi1,465+/-69,700XiaomiMIT
27AlibabaQwen3.5 Max PreviewAlibaba1,464+/-517,346AlibabaProprietary
28Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1,463+/-445,395Google Deep MindProprietary
29Moonshot AIKimi K2.6Moonshot AI1,462+/-610,281Moonshot AIModified MIT
30DeepSeek-AIDeepSeek-V4-Pro (thinking)DeepSeek-AI1,461+/-69,970DeepSeek-AIMIT
31xAIGrok 4.1xAI1,460+/-363,263xAIProprietary
32DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1,459+/-610,729DeepSeek-AIMIT
33阿里Qwen3.6-Max-Preview阿里巴巴1,457+/-94,227阿里巴巴Proprietary
34智谱GLM-5智谱AI1,457+/-520,816智谱AIMIT
35BytedanceDOLA Seed 2.0 ProBytedance1,456+/-430,543BytedanceProprietary
36OpenAIGPT-5.1 Pro (high)OpenAI1,455+/-440,848OpenAIProprietary
37AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1,454+/-371,013AnthropicProprietary
38AnthropicClaude Sonnet 4.5Anthropic1,454+/-369,196AnthropicProprietary
39OpenAIGPT-5.4 mini (high)OpenAI1,454+/-518,979OpenAIProprietary
40DeepMindGemma 4 31BDeepMind1,451+/-85,840DeepMindApache 2.0
41xAIGrok 4.3 BetaxAI1,451+/-69,082xAIProprietary
42百度ERNIE 5.0百度1,450+/-431,558百度Proprietary
43Moonshot AIKimi K2 ThinkingMoonshot AI1,449+/-430,661Moonshot AIModified MIT
44百度ERNIE 5.0百度1,449+/-79,754百度Proprietary
45OpenAIGPT-5.3OpenAI1,449+/-526,710OpenAIProprietary
46AnthropicOpus 4.1 (thinking-16k)Anthropic1,449+/-349,822AnthropicProprietary
47XIMiMo V2 ProXiaomi1,447+/-518,975XiaomiProprietary
48AnthropicOpus 4.1Anthropic1,447+/-377,378AnthropicProprietary
49Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1,446+/-3118,726Google Deep MindProprietary
50阿里Qwen3.5-397B-A17B阿里巴巴1,445+/-425,861阿里巴巴Apache 2.0
51OpenAIGPT-4.5OpenAI1,444+/-614,547OpenAIProprietary
52阿里Qwen 3.6 Plus Preview阿里巴巴1,444+/-612,075阿里巴巴Proprietary
53OpenAIGPT-4o(2025-03-27)OpenAI1,443+/-382,481OpenAIProprietary
54智谱GLM-4.7智谱AI1,443+/-612,134智谱AIMIT
55DeepSeek-AIDeepSeek-V4-Flash (thinking)DeepSeek-AI1,440+/-610,115DeepSeek-AIMIT
56OpenAIGPT-5.2 Pro (high)OpenAI1,439+/-442,076OpenAIProprietary
57OpenAIGPT-5.1 InstantOpenAI1,439+/-443,497OpenAIProprietary
58DeepMindGemma 4 26B A4BDeepMind1,438+/-85,782DeepMindApache 2.0
59Googlegemini-3.1-flash-lite-previewGoogle1,436+/-427,771GoogleProprietary
60OpenAIGPT-5.2OpenAI1,436+/-439,304OpenAIProprietary
61MeituanLongCat Flash Chat (2602)Meituan1,435+/-517,017MeituanProprietary
62阿里Qwen3 Max (Preview)阿里巴巴1,435+/-527,731阿里巴巴Proprietary
63OpenAIGPT-5-Pro (high)OpenAI1,434+/-531,951OpenAIProprietary
64DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1,433+/-610,124DeepSeek-AIMIT
65MoonshotKimi K2.5 InstantMoonshot1,432+/-78,201MoonshotModified MIT
66xAIGrok 4.1 Fast (fast-reasoning)xAI1,431+/-352,836xAIProprietary
67OpenAIOpenAI o3OpenAI1,431+/-459,771OpenAIProprietary
68Moonshot AIKimi K2 Thinking (thinking-turbo)Moonshot AI1,430+/-356,632Moonshot AIModified MIT
69XIMiMo V2.5Xiaomi1,429+/-69,729XiaomiMIT
70Amazonamazon-nova-experimental-chat-26-02-10Amazon1,427+/-103,433AmazonProprietary
71OpenAIGPT-5OpenAI1,427+/-431,598OpenAIProprietary
72智谱GLM-4.6智谱AI1,426+/-435,672智谱AIMIT
73DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1,425+/-79,069DeepSeek-AIMIT
74Alibabaqwen3-max-2025-09-23Alibaba1,424+/-69,166AlibabaProprietary
75AnthropicClaude Opus 4 (thinking-16k)Anthropic1,424+/-436,920AnthropicProprietary
76DeepSeek-AIDeepSeek V3.2DeepSeek-AI1,424+/-445,275DeepSeek-AIMIT
77阿里Qwen3-235B-A22B-2507阿里巴巴1,423+/-392,159阿里巴巴Apache 2.0
78DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1,423+/-611,936DeepSeek-AIMIT
79DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1,422+/-618,467DeepSeek-AIMIT
80DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1,422+/-439,392DeepSeek-AIMIT
81xAIGrok 4 FastxAI1,421+/-86,817xAIProprietary
82百度ERNIE 5.0百度1,419+/-94,711百度Proprietary
83Moonshot AIKimi K2 0905Moonshot AI1,418+/-611,795Moonshot AIModified MIT
84DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1,418+/-614,974DeepSeek-AIMIT
85阿里Qwen3.5-122B-A10B阿里巴巴1,418+/-523,088阿里巴巴Apache 2.0
86Tencenthunyuan-hy3-previewTencent1,417+/-85,184Tencenttencent-hunyuan-community
87DeepSeek-AIDeepSeek-V3.1 Terminus (thinking)DeepSeek-AI1,417+/-103,471DeepSeek-AIMIT
88Moonshot AIKimi K2Moonshot AI1,417+/-527,644Moonshot AIModified MIT
89DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1,417+/-711,753DeepSeek-AIMIT
90DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1,416+/-103,707DeepSeek-AIMIT
91Amazonamazon-nova-experimental-chat-26-01-10Amazon1,416+/-103,415AmazonProprietary
92阿里Qwen3-VL-235B-A22B-Instruct阿里巴巴1,415+/-611,518阿里巴巴Apache 2.0
93MistralAIMistral Large 3MistralAI1,415+/-441,713MistralAIApache 2.0
94OpenAIGPT-4.1OpenAI1,413+/-450,995OpenAIProprietary
95AnthropicClaude Opus 4Anthropic1,412+/-444,235AnthropicProprietary
96xAIGrok 3xAI1,412+/-432,914xAIProprietary
97智谱GLM-4.5智谱AI1,411+/-524,324智谱AIMIT
98Google Deep MindGemini 2.5 FlashGoogle Deep Mind1,411+/-3118,454Google Deep MindProprietary
99AnthropicHaiku 4.5Anthropic1,410+/-370,948AnthropicProprietary
100xAIGrok 4xAI1,410+/-441,407xAIProprietary
101MistralAIMagistral-Medium-2506MistralAI1,410+/-388,255MistralAIProprietary
102MiniMaxAIMiniMax-M2.7MiniMaxAI1,409+/-516,997MiniMaxAIModified MIT
103阿里Qwen3.5-27B阿里巴巴1,409+/-522,477阿里巴巴Apache 2.0
104OpenAIGPT-5.4 nano (high)OpenAI1,406+/-518,377OpenAIProprietary
105Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1,405+/-432,923Google Deep MindProprietary
106xAIGrok 4 Fast (fast-reasoning)xAI1,404+/-518,720xAIProprietary
107Alibabaqwen3-235b-a22b-no-thinkingAlibaba1,403+/-538,234AlibabaApache 2.0
108阿里Qwen3-Next阿里巴巴1,402+/-522,869阿里巴巴Apache 2.0
109OpenAIOpenAI o1OpenAI1,402+/-427,807OpenAIProprietary
110MeituanLongCat Flash Chat (2602)Meituan1,401+/-611,402MeituanMIT
111Alibabaqwen3-235b-a22b-thinking-2507Alibaba1,399+/-78,999AlibabaApache 2.0
112AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1,399+/-435,123AnthropicProprietary
113DeepSeek-AIDeepSeek-R1DeepSeek-AI1,398+/-518,524DeepSeek-AIMIT
114阿里Qwen3.5-35B-A3B阿里巴巴1,397+/-523,500阿里巴巴Apache 2.0
115Tencenthunyuan-vision-1.5-thinkingTencent1,396+/-122,219TencentProprietary
116StepFunAIStep 3.5 FlashStepFunAI1,396+/-523,305StepFunAIProprietary
117阿里Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴1,396+/-77,941阿里巴巴Apache 2.0
118DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1,395+/-445,520DeepSeek-AIMIT
119StepFunAIStep 3.5 FlashStepFunAI1,395+/-428,558StepFunAIApache 2.0
120Amazonamazon-nova-experimental-chat-12-10Amazon1,395+/-103,683AmazonProprietary
121MiniMaxAIMiniMax M2.5MiniMaxAI1,394+/-428,912MiniMaxAIModified MIT
122XImimo-v2-flash (non-thinking)Xiaomi1,393+/-440,885XiaomiMIT
123Microsoft AIMAI Image 1Microsoft AI1,393+/-517,890Microsoft AIProprietary
124OpenAIGPT-5-mini (high)OpenAI1,390+/-527,045OpenAIProprietary
125OpenAIOpenAI o4 - miniOpenAI1,390+/-445,450OpenAIProprietary
126AnthropicClaude Sonnet 4Anthropic1,389+/-440,333AnthropicProprietary
127OpenAIOpenAI o1OpenAI1,388+/-531,122OpenAIProprietary
128腾讯Hunyuan-T1腾讯AI实验室1,387+/-94,714腾讯AI实验室Proprietary
129XImimo-v2-flash (thinking)Xiaomi1,387+/-610,973XiaomiMIT
130阿里Qwen3-Coder-480B-A35B阿里巴巴1,387+/-525,753阿里巴巴Apache 2.0
131AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1,387+/-438,839AnthropicProprietary
132Mistralmistral-medium-2505Mistral1,387+/-533,243MistralProprietary
133MiniMaxAIM2.1MiniMaxAI1,385+/-517,156MiniMaxAIMIT
134阿里Qwen3-30B-A3B-2507阿里巴巴1,383+/-523,750阿里巴巴Apache 2.0
135OpenAIGPT-4.1 miniOpenAI1,382+/-439,354OpenAIProprietary
136Tencenthunyuan-turbos-20250416Tencent1,382+/-610,723TencentProprietary
137Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind1,380+/-347,249Google Deep MindProprietary
138智谱GLM-4.6V智谱AI1,378+/-112,806智谱AIMIT
139ARtrinity-large-previewArcee AI1,376+/-524,901Arcee AIApache 2.0
140阿里Qwen3-235B-A22B阿里巴巴1,375+/-526,278阿里巴巴Apache 2.0
141Google Deep MindGemini 2.5 Flash-Lite (thinking)Google Deep Mind1,375+/-532,934Google Deep MindProprietary
142阿里Qwen2.5-Max阿里巴巴1,374+/-432,624阿里巴巴Proprietary
143智谱GLM-4.5-Air智谱AI1,373+/-431,099智谱AIMIT
144ARtrinity-large-thinkingArcee AI1,373+/-516,436Arcee AIApache 2.0
145AnthropicClaude 3.5 SonnetAnthropic1,372+/-388,356AnthropicProprietary
146AnthropicClaude Sonnet 3.7Anthropic1,371+/-443,197AnthropicProprietary
147阿里Qwen3-Next (thinking)阿里巴巴1,369+/-613,706阿里巴巴Apache 2.0
148智谱GLM-4.7-Flash智谱AI1,368+/-611,750智谱AIMIT
149Amazonamazon-nova-experimental-chat-11-10Amazon1,367+/-425,416AmazonProprietary
150Google Deep MindGemma 3 - 27B (IT)Google Deep Mind1,366+/-447,559Google Deep MindGemma
151MiniMaxminimax-m1MiniMax1,364+/-435,221MiniMaxApache 2.0
152OpenAIOpenAI o3-mini (high)OpenAI1,363+/-518,589OpenAIProprietary
153OpenAIOpenAI o3-mini (high)OpenAI1,362+/-516,973OpenAIProprietary
154Nvidianvidia-nemotron-3-super-120b-a12bNvidia1,361+/-77,418NvidiaNVIDIA Open Model
155DeepMindGemini 2.0 Flash ExperimentalDeepMind1,360+/-443,765DeepMindProprietary
156DeepSeek-AIDeepSeek-V3DeepSeek-AI1,358+/-521,770DeepSeek-AIDeepSeek
157MistralAIMistral-Small-3.2MistralAI1,357+/-517,716MistralAIApache 2.0
158xAIgrok-3-mini-betaxAI1,357+/-522,729xAIProprietary
159PRintellect-3Prime Intellect1,357+/-85,328Prime IntellectMIT
160CohereAIC4AI Command A (202503)CohereAI1,354+/-356,294CohereAICC-BY-NC-4.0
161智谱GLM-4.5V智谱AI1,353+/-84,962智谱AIMIT
162DeepMindGemini 2.0 Flash-LiteDeepMind1,353+/-424,955DeepMindProprietary
163OpenAIGPT OSS 120BOpenAI1,353+/-430,646OpenAIApache 2.0
164Google Deep MindGemini 1.5 ProGoogle Deep Mind1,351+/-355,606Google Deep MindProprietary
165Amazonamazon-nova-experimental-chat-10-20Amazon1,350+/-611,470AmazonProprietary
166Tencenthunyuan-turbos-20250226Tencent1,349+/-122,220TencentProprietary
167StepFunAIStep3StepFunAI1,348+/-76,551StepFunAIApache 2.0
168Amazonamazon-nova-experimental-chat-10-09Amazon1,348+/-112,841AmazonProprietary
169OpenAIOpenAI o3-miniOpenAI1,347+/-457,349OpenAIProprietary
170Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1,347+/-122,549NvidiaNvidia Open Model
171阿里Qwen3-32B阿里巴巴1,347+/-93,926阿里巴巴Apache 2.0
172INmercury-2Inception AI1,347+/-113,120Inception AIProprietary
173INling-flash-2.0InclusionAI1,346+/-77,010InclusionAIMIT
174MiniMaxAIMiniMax M2MiniMaxAI1,346+/-86,868MiniMaxAIApache 2.0
175Alibabaqwen-plus-0125Alibaba1,346+/-85,819AlibabaProprietary
176OpenAIGPT-4oOpenAI1,345+/-3112,881OpenAIProprietary
177Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1,343+/-103,344NvidiaNvidia Open
178ZHglm-4-plus-0111Zhipu1,343+/-85,760ZhipuProprietary
179AnthropicClaude 3.5 SonnetAnthropic1,342+/-382,419AnthropicProprietary
180Google Deep MindGemma 3 - 12B (IT)Google Deep Mind1,342+/-103,829Google Deep MindGemma
181Tencenthunyuan-turbo-0110Tencent1,340+/-122,290TencentProprietary
182OpenAIGPT-5-Nano (high)OpenAI1,337+/-78,273OpenAIProprietary
183亚马Nova 2 Lite亚马逊1,337+/-612,242亚马逊Proprietary
184OpenAIOpenAI o1-miniOpenAI1,337+/-451,981OpenAIProprietary
185阿里QwQ-32B阿里巴巴1,336+/-425,403阿里巴巴Apache 2.0
186xAIGrok 2xAI1,335+/-463,498xAIProprietary
187Googlegemini-advanced-0514Google1,335+/-550,148GoogleProprietary
188OpenAIGPT-4oOpenAI1,335+/-445,499OpenAIProprietary
189Metallama-3.1-405b-instruct-bf16Meta1,334+/-441,375MetaLlama 3.1 Community
190StepFunstep-2-16k-exp-202412StepFun1,334+/-94,833StepFunProprietary
191Metallama-3.1-405b-instruct-fp8Meta1,333+/-459,656MetaLlama 3.1 Community
192AIolmo-3.1-32b-instructAi21,330+/-612,228Ai2Apache 2.0
19301yi-lightning01 AI1,328+/-527,33201 AIProprietary
194AImolmo-2-8bAi21,328+/-21805Ai2Apache 2.0
195Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1,328+/-122,218NvidiaNvidia
196阿里Qwen3-30B-A3B阿里巴巴1,327+/-526,500阿里巴巴Apache 2.0
197FALlama 4 Maverick InstructFacebook AI研究实验室1,327+/-439,993Facebook AI研究实验室Llama 4
198Tencenthunyuan-large-2025-02-10Tencent1,326+/-103,738TencentProprietary
199OpenAIRunway Gen-4 TurboOpenAI1,324+/-498,114OpenAIProprietary
200DeepSeekdeepseek-v2.5-1210DeepSeek1,323+/-86,795DeepSeekDeepSeek
201Google Deep MindGemini 1.5 ProGoogle Deep Mind1,323+/-479,138Google Deep MindProprietary
202AnthropicClaude 3.5 HaikuAnthropic1,323+/-370,008AnthropicProprietary
203FALlama 4 Scout InstructFacebook AI研究实验室1,322+/-530,312Facebook AI研究实验室Llama
204OpenAIGPT-4.1 nanoOpenAI1,322+/-86,103OpenAIProprietary
205AnthropicClaude3-OpusAnthropic1,321+/-3194,909AnthropicProprietary
206INring-flash-2.0InclusionAI1,321+/-77,153InclusionAIMIT
207StepFunstep-1o-turbo-202506StepFun1,320+/-79,041StepFunProprietary
208ZHglm-4-plusZhipu AI1,319+/-526,126Zhipu AIProprietary
209Google Deep MindGemma-3n-E4BGoogle Deep Mind1,318+/-522,606Google Deep MindGemma
210FALlama3.3-70B-InstructFacebook AI研究实验室1,318+/-354,746Facebook AI研究实验室Llama-3.3
211Alibabaqwen-max-0919Alibaba1,318+/-616,478AlibabaQwen
212OpenAIGPT-4o miniOpenAI1,317+/-468,710OpenAIProprietary
213OpenAIGPT OSS 20BOpenAI1,317+/-610,633OpenAIApache 2.0
214Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1,317+/-615,517NvidiaNVIDIA Open Model
215Alibabaqwen2.5-plus-1127Alibaba1,315+/-610,187AlibabaProprietary
216NEathene-v2-chatNexusFlow1,314+/-524,739NexusFlowNexusFlow
217Mistralmistral-large-2407Mistral1,314+/-445,459MistralMistral Research
218OpenAIGPT-4OpenAI1,312+/-493,439OpenAIProprietary
219OpenAIGPT-4OpenAI1,312+/-4100,105OpenAIProprietary
220IBgranite-4.1-8bIBM1,311+/-113,240IBMApache 2.0
221Tencenthunyuan-standard-2025-02-10Tencent1,311+/-103,904TencentProprietary
222Googlegemini-1.5-flash-002Google1,309+/-434,902GoogleProprietary
223xAIgrok-2-mini-2024-08-13xAI1,308+/-452,567xAIProprietary
224DeepSeek-AIDeepSeek V2.5DeepSeek-AI1,307+/-524,572DeepSeek-AIDeepSeek
225INmercuryInception AI1,306+/-141,954Inception AIProprietary
226NEathene-70b-0725NexusFlow1,306+/-619,621NexusFlowCC-BY-NC-4.0
227AIolmo-3-32b-thinkAi21,305+/-85,953Ai2Apache 2.0
228Mistralmistral-large-2411Mistral1,305+/-428,073MistralMRL
229MistralAIMagistral-Medium-2506MistralAI1,304+/-611,643MistralAIProprietary
230Google Deep MindGemma 3 - 4B (IT)Google Deep Mind1,303+/-94,171Google Deep MindGemma
231MistralAIMistral-Small-3.1-24B-Instruct-2503MistralAI1,303+/-533,235MistralAIApache 2.0
232阿里Qwen2.5-VL-72B-Instruct阿里巴巴1,303+/-439,406阿里巴巴Qwen
233FALlama3.1-70B-InstructFacebook AI研究实验室1,299+/-87,140Facebook AI研究实验室Llama 3.1
234Tencenthunyuan-large-visionTencent1,294+/-95,371TencentProprietary
235FALlama3.1-70B-InstructFacebook AI研究实验室1,293+/-455,240Facebook AI研究实验室Llama 3.1 Community
236Amazonamazon-nova-pro-v1.0Amazon1,290+/-524,745AmazonProprietary
237AIjamba-1.5-largeAI21 Labs1,289+/-78,662AI21 LabsJamba Open
238Googlegemma-2-27b-itGoogle1,288+/-375,754GoogleGemma license
239REreka-core-20240904Reka AI1,287+/-77,312Reka AIProprietary
240IBibm-granite-h-smallIBM1,287+/-85,677IBMApache 2.0
241OpenAIGPT-4OpenAI1,286+/-554,173OpenAIProprietary
242AIllama-3.1-tulu-3-70bAi21,286+/-102,846Ai2Llama 3.1
243Googlegemini-1.5-flash-001Google1,286+/-462,833GoogleProprietary
244Nvidiallama-3.1-nemotron-51b-instructNvidia1,285+/-103,749NvidiaLlama 3.1
245AIolmo-3.1-32b-thinkAi21,285+/-78,505Ai2Apache 2.0
246AnthropicClaude3-SonnetAnthropic1,280+/-4109,284AnthropicProprietary
247PRgemma-2-9b-it-simpoPrinceton1,279+/-710,072PrincetonMIT
248Nvidianemotron-4-340b-instructNvidia1,276+/-519,659NvidiaNVIDIA Open Model
249Coherecommand-r-plus-08-2024Cohere1,276+/-79,866CohereCC-BY-NC-4.0
250FALlama3-70B-InstructFacebook AI研究实验室1,275+/-4156,876Facebook AI研究实验室Llama 3 Community
251OpenAIGPT-4OpenAI1,274+/-488,723OpenAIProprietary
252MistralAIMistral Small 24B Instruct 2501MistralAI1,274+/-614,681MistralAIApache 2.0
253智谱GLM4智谱AI1,273+/-79,788智谱AIProprietary
254REreka-flash-20240904Reka AI1,271+/-77,536Reka AIProprietary
255阿里Qwen2.5-Coder-32B-Instruct阿里巴巴1,270+/-85,432阿里巴巴Apache 2.0
256CohereAIC4AI Aya Vision 32BCohereAI1,267+/-527,124CohereAICC-BY-NC-4.0
257Googlegemma-2-9b-itGoogle1,266+/-454,611GoogleGemma license
258DeepSeekdeepseek-coder-v2DeepSeek1,264+/-615,147DeepSeekDeepSeek License
259阿里Qwen2-72B-Instruct阿里巴巴1,261+/-537,325阿里巴巴Qianwen LICENSE
260CohereAIC4AI Command R+CohereAI1,261+/-477,554CohereAICC-BY-NC-4.0
261AnthropicClaude3-HaikuAnthropic1,260+/-4117,701AnthropicProprietary
262Amazonamazon-nova-lite-v1.0Amazon1,260+/-519,372AmazonProprietary
263Googlegemini-1.5-flash-8b-001Google1,258+/-435,558GoogleProprietary
264Microsoft AzurePhi 4 - 14BMicrosoft Azure1,256+/-524,126Microsoft AzureMIT
265AIolmo-2-0325-32b-instructAi21,251+/-113,334Ai2Apache-2.0
266Coherecommand-r-08-2024Cohere1,249+/-710,140CohereCC-BY-NC-4.0
267Mistralmistral-large-2402Mistral1,241+/-562,436MistralProprietary
268Amazonamazon-nova-micro-v1.0Amazon1,240+/-519,364AmazonProprietary
269AIjamba-1.5-miniAI21 Labs1,239+/-78,858AI21 LabsJamba Open
270Mistralministral-8b-2410Mistral1,237+/-94,781MistralMRL
271Googlegemini-pro-dev-apiGoogle1,235+/-718,354GoogleProprietary
272阿里Qwen1.5-110B-Chat阿里巴巴1,233+/-626,195阿里巴巴Qianwen LICENSE
273Tencenthunyuan-standard-256kTencent1,233+/-122,728TencentProprietary
274REreka-flash-21b-20240226-onlineReka AI1,232+/-715,450Reka AIProprietary
275阿里Qwen1.5-72B-Chat阿里巴巴1,232+/-539,302阿里巴巴Qianwen LICENSE
276MistralAIMixtral-8x22B-Instruct-v0.1MistralAI1,228+/-551,416MistralAIApache 2.0
277Coherecommand-rCohere1,226+/-554,036CohereCC-BY-NC-4.0
278REreka-flash-21b-20240226Reka AI1,226+/-624,806Reka AIProprietary
279OpenAIgpt-3.5-turbo-0125OpenAI1,223+/-566,207OpenAIProprietary
280CohereAIC4AI Aya Vision 8BCohereAI1,223+/-79,818CohereAICC-BY-NC-4.0
281FALlama3-8B-InstructFacebook AI研究实验室1,223+/-4104,642Facebook AI研究实验室Llama 3 Community
282Mistralmistral-mediumMistral1,222+/-534,550MistralProprietary
283DeepMindGemini-proDeepMind1,221+/-126,390DeepMindProprietary
284AIllama-3.1-tulu-3-8bAi21,220+/-112,896Ai2Llama 3.1
285零一Yi-1.5-34B零一万物1,212+/-524,146零一万物Apache-2.0
286HUzephyr-orpo-141b-A35b-v0.1HuggingFace1,212+/-114,652HuggingFaceApache 2.0
287FALlama3.1-8B-InstructFacebook AI研究实验室1,211+/-449,605Facebook AI研究实验室Llama 3.1 Community
288FALlama3.1-8B-InstructFacebook AI研究实验室1,207+/-113,090Facebook AI研究实验室Apache 2.0
289Alibabaqwen1.5-32b-chatAlibaba1,203+/-621,741AlibabaQianwen LICENSE
290OpenAIgpt-3.5-turbo-1106OpenAI1,202+/-916,619OpenAIProprietary
291Googlegemma-2-2b-itGoogle1,199+/-446,616GoogleGemma license
292Microsoft AzurePhi-3-medium 14B-previewMicrosoft Azure1,197+/-525,055Microsoft AzureMIT
293Mistralmixtral-8x7b-instruct-v0.1Mistral1,196+/-473,503MistralApache 2.0
294DADBRX Instructdatabricks1,194+/-632,191databricksDBRX LICENSE
295上海InternLM2-Base-20B上海人工智能实验室1,190+/-79,901上海人工智能实验室Other
296阿里Qwen1.5-14B-Chat阿里巴巴1,190+/-717,839阿里巴巴Qianwen LICENSE
297WIWizardLM-70B-V1.0WizardLM Team1,184+/-98,214WizardLM TeamLlama 2 Community
298DeepSeek-AIDeepSeek LLM 67B ChatDeepSeek-AI1,183+/-124,932DeepSeek-AIDeepSeek License
299零一Yi-34B零一万物1,183+/-715,483零一万物Yi License
300IBgranite-3.0-8b-instructIBM1,181+/-96,638IBMApache 2.0
301OPopenchat-3.5OpenChat1,181+/-107,968OpenChatApache-2.0
302OPopenchat-3.5-0106OpenChat1,181+/-812,637OpenChatApache-2.0
303Google ResearchGemma 1.1-7B-ITGoogle Research1,180+/-623,893Google ResearchGemma license
304SNsnowflake-arctic-instructSnowflake1,178+/-632,832SnowflakeApache 2.0
305IBgranite-3.1-2b-instructIBM1,178+/-113,188IBMApache 2.0
306ALtulu-2-dpo-70bAllenAI/UW1,177+/-106,535AllenAI/UWAI2 ImpACT Low-risk
307NOopenhermes-2.5-mistral-7bNousResearch1,174+/-105,006NousResearchApache-2.0
308LMVicuna 33BLM-SYS1,172+/-622,479LM-SYSNon-commercial
309NEstarling-lm-7b-betaNexusflow1,171+/-716,056NexusflowApache-2.0
310Microsoft AzurePhi-3-small 7BMicrosoft Azure1,170+/-617,766Microsoft AzureMIT
311Metallama-2-70b-chatMeta1,170+/-638,492MetaLlama 2 Community
312UCstarling-lm-7b-alphaUC Berkeley1,166+/-810,224UC BerkeleyCC-BY-NC-4.0
313Metallama-3.2-3b-instructMeta1,166+/-87,936MetaLlama 3.2
314NOnous-hermes-2-mixtral-8x7b-dpoNousResearch1,164+/-123,777NousResearchApache-2.0
315阿里QwQ-32B-Preview阿里巴巴1,155+/-113,231阿里巴巴Apache 2.0
316阿里Qwen3-VL-2B阿里巴巴1,155+/-86,837阿里巴巴Apache 2.0
317Nvidiallama2-70b-steerlm-chatNvidia1,154+/-133,585NvidiaLlama 2 Community
318UPsolar-10.7b-instruct-v1.0Upstage AI1,151+/-134,155Upstage AICC-BY-NC-4.0
319COdolphin-2.2.1-mistral-7bCognitive Computations1,151+/-151,679Cognitive ComputationsApache-2.0
320MOMPT-30B-ChatMosaicML1,149+/-122,572MosaicMLCC-BY-NC-SA-4.0
321MistralAIMistral-7B-Instruct-v0.2MistralAI1,148+/-719,402MistralAIApache-2.0
322Microsoftwizardlm-13bMicrosoft1,148+/-97,044MicrosoftLlama 2 Community
323TIfalcon-180b-chatTII1,146+/-171,295TIIFalcon-180B TII License
324阿里Qwen1.5-7B-Chat阿里巴巴1,143+/-104,737阿里巴巴Qianwen LICENSE
325Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,142+/-612,297Microsoft AzureMIT
326百川Baichuan2-13B-Chat百川智能1,140+/-719,174百川智能Llama 2 Community
327LMVicuna 13BLM-SYS1,140+/-719,367LM-SYSLlama 2 Community
328阿里Qwen-14B-Chat阿里巴巴1,137+/-114,964阿里巴巴Qianwen LICENSE
329Google ResearchPaLM 2Google Research1,137+/-98,554Google ResearchProprietary
330Google ResearchGemma 7B - ItGoogle Research1,136+/-98,925Google ResearchGemma license
331FACodeLLaMA-34BFacebook AI研究实验室1,135+/-97,366Facebook AI研究实验室Llama 2 Community
332HUzephyr-7b-betaHuggingFace1,130+/-911,118HuggingFaceMIT
333Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,128+/-720,685Microsoft AzureMIT
334Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,127+/-620,118Microsoft AzureMIT
335UWguanaco-33bUW1,126+/-122,921UWNon-commercial
336HUzephyr-7b-alphaHuggingFace1,126+/-161,785HuggingFaceMIT
337TOstripedhyena-nous-7bTogether AI1,120+/-115,182Together AIApache 2.0
338FACodeLlama-70B-InstructFacebook AI研究实验室1,118+/-181,143Facebook AI研究实验室Llama 2 Community
339Google ResearchGemma 1.1-2B-ITGoogle Research1,114+/-810,854Google ResearchGemma license
340LMVicuna 7BLM-SYS1,114+/-96,923LM-SYSLlama 2 Community
341HUsmollm2-1.7b-instructHuggingFace1,113+/-142,199HuggingFaceApache 2.0
342Metallama-3.2-1b-instructMeta1,110+/-88,045MetaLlama 3.2
343MistralAIMistral 7B InstructMistralAI1,109+/-98,977MistralAIApache 2.0
344百川Baichuan2-7B-Chat百川智能1,107+/-714,148百川智能Llama 2 Community
345Google ResearchGemma 2B - ItGoogle Research1,092+/-124,780Google ResearchGemma license
346阿里Qwen1.5-4B-Chat阿里巴巴1,089+/-97,597阿里巴巴Qianwen LICENSE
347AIolmo-7b-instructAi21,073+/-116,328Ai2Apache-2.0
348达摩Koala达摩院1,069+/-106,965达摩院Non-commercial
349STalpaca-13bStanford1,067+/-115,745StanfordNon-commercial
350NOGPT4All 13BNomic AI1,065+/-151,743Nomic AINon-commercial
351MOMPT-7B-ChatMosaicML1,061+/-123,924MosaicMLCC-BY-NC-SA-4.0
352智谱ChatGLM3-6B智谱AI1,055+/-124,658智谱AIApache-2.0
353RWRWKV-4-Raven-14BRWKV1,040+/-114,845RWKVApache 2.0
354智谱ChatGLM2-6B智谱AI1,023+/-142,658智谱AIApache-2.0
355OPoasst-pythia-12bOpenAssistant1,021+/-116,310OpenAssistantApache 2.0
356智谱ChatGLM-6B智谱AI994+/-134,914智谱AINon-commercial
357LMfastchat-t5-3bLMSYS990+/-124,203LMSYSApache 2.0
358DAdolly-v2-12bDatabricks979+/-143,412DatabricksMIT
359FALLaMA 13BFacebook AI研究实验室972+/-162,391Facebook AI研究实验室Non-commercial
360STstablelm-tuned-alpha-7bStability AI952+/-133,287Stability AICC-BY-NC-SA-4.0

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

01

什么是 Text Generation Arena (LMArena)?

Text Generation Arena(原 LMSYS Chatbot Arena)是目前最具影响力的大模型匿名评测平台。用户向两个身份未知的模型提问,根据回答质量投票,系统通过 Elo 算法将数百万次投票汇聚为动态排行榜,被学术界和工业界广泛引用。

02

Arena Elo 分数是如何计算的?

Elo 算法源自国际象棋评分体系。每次对战后,胜者得分上升、败者下降,幅度取决于双方原始评分差距。95% 置信区间(CI)反映该模型参与对战次数的多少:CI 越窄说明数据越充分、排名越可信。

03

为什么同一模型会出现"Thinking"和普通两个版本?

部分模型支持"扩展思考"(Extended Thinking)模式,会在给出最终答案前进行更深入的内部推理。该模式通常在逻辑推理、数学和编程任务上得分更高,但响应时延也更长、成本更高。Arena 将两种模式分开评测,以便用户根据实际需求选择。

04

如何根据排行榜选择适合自己的大语言模型?

建议综合考虑:综合性能(看 Elo 总分)、成本(闭源 API 按量计费,开源可自部署)、中文支持、开源程度以及响应速度。