Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Claude Fable 5

最高得分

1,509

模型数量

369

数据版本

2026年07月01日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicClaude Fable 5Anthropic1,509+/-94,350AnthropicProprietary
AnthropicClaude Opus 4.6 (thinking)Anthropic1,504+/-455,102AnthropicProprietary
AnthropicOpus 4.7 (thinking)Anthropic1,502+/-441,868AnthropicProprietary
4AnthropicClaude Opus 4.6Anthropic1,499+/-458,565AnthropicProprietary
5AnthropicOpus 4.7Anthropic1,494+/-443,053AnthropicProprietary
6Muse SparkFacebook AI研究实验室1,487+/-613,591Facebook AI研究实验室Proprietary
7Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1,486+/-473,099Google Deep MindProprietary
8Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1,486+/-441,306Google Deep MindProprietary
9AnthropicClaude Opus 4.8 (thinking)Anthropic1,484+/-522,340AnthropicProprietary
10OpenAIGPT-5.5 (high)OpenAI1,481+/-537,260OpenAIProprietary
11Google Deep MindGemini 3.5 FlashGoogle Deep Mind1,479+/-615,261Google Deep MindProprietary
12OpenAIGPT-5.4 (high)OpenAI1,478+/-450,378OpenAIProprietary
13AnthropicClaude Opus 4.8Anthropic1,477+/-622,687AnthropicProprietary
14OpenAIGPT-5.2OpenAI1,476+/-434,518OpenAIProprietary
15Qwen3.7-Max-Preview阿里巴巴1,475+/-103,727阿里巴巴Proprietary
16OpenAIGPT-5.5OpenAI1,475+/-538,470OpenAIProprietary
17xAIgrok-4.20-beta-0309-reasoningxAI1,475+/-451,724xAIProprietary
18xAIGrok 4.20 BetaxAI1,474+/-526,920xAIProprietary
19Google Deep MindGemini 3.0 FlashGoogle Deep Mind1,473+/-430,711Google Deep MindProprietary
20AnthropicClaude Opus 4 (thinking-32k)Anthropic1,473+/-437,085AnthropicProprietary
21OpenAIGPT-5.5 InstantOpenAI1,473+/-526,172OpenAIProprietary
22GLM 5.1智谱AI1,472+/-522,689智谱AIMIT
23AnthropicClaude Sonnet 4.6Anthropic1,472+/-448,706AnthropicProprietary
24xAIgrok-4.20-multi-agent-beta-0309xAI1,470+/-450,647xAIProprietary
25AnthropicClaude Opus 4Anthropic1,469+/-371,135AnthropicProprietary
26GLM-5.2 (max)智谱AI1,469+/-79,338智谱AIMIT
27ERNIE-5.1-Preview百度1,468+/-532,742百度Proprietary
28OpenAIGPT-5.4OpenAI1,467+/-453,056OpenAIProprietary
29mimo-v2.5-proXiaomi1,466+/-534,468XiaomiMIT
30xAIGrok 4.1 ThinkingxAI1,466+/-365,565xAIProprietary
31Qwen3.5 Max Preview阿里巴巴1,465+/-521,531阿里巴巴Proprietary
32Anthropicclaude-sonnet-5-thinkingAnthropic1,464+/-94,385AnthropicProprietary
33Alibabaqwen3.7-plusAlibaba1,463+/-614,900AlibabaProprietary
34Moonshot AIKimi K2.6Moonshot AI1,461+/-532,657Moonshot AIModified MIT
35Qwen3.6-Max-Preview阿里巴巴1,460+/-85,215阿里巴巴Proprietary
36Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1,460+/-375,608Google Deep MindProprietary
37xAIGrok 4.1xAI1,459+/-367,704xAIProprietary
38DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1,457+/-537,116DeepSeek-AIMIT
39GLM-5智谱AI1,457+/-427,897智谱AIMIT
40DeepSeek-AIDeepSeek-V4-Pro (thinking)DeepSeek-AI1,457+/-535,108DeepSeek-AIMIT
41AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1,455+/-382,456AnthropicProprietary
42AnthropicClaude Sonnet 4.5Anthropic1,455+/-380,895AnthropicProprietary
43DOLA Seed 2.0 Pro字节跳动Seed团队1,455+/-459,361字节跳动Seed团队Proprietary
44OpenAIGPT-5.1 Pro (high)OpenAI1,455+/-440,823OpenAIProprietary
45DeepMindGemma 4 31BDeepMind1,451+/-85,898DeepMindApache 2.0
46Moonshot AIKimi K2 ThinkingMoonshot AI1,450+/-455,504Moonshot AIModified MIT
47OpenAIGPT-5.4 mini (high)OpenAI1,449+/-448,963OpenAIProprietary
48AnthropicOpus 4.1 (thinking-16k)Anthropic1,449+/-349,773AnthropicProprietary
49ERNIE 5.0百度1,449+/-79,746百度Proprietary
50OpenAIGPT-5.3OpenAI1,449+/-433,084OpenAIProprietary
51mimo-v2-proXiaomi1,448+/-524,557XiaomiProprietary
52AnthropicOpus 4.1Anthropic1,447+/-377,292AnthropicProprietary
53ERNIE 5.0百度1,447+/-435,291百度Proprietary
54MiniMaxminimax-m3MiniMax1,447+/-619,761MiniMaxMiniMax Community License
55Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1,446+/-3124,527Google Deep MindProprietary
56OpenAIGPT-4.5OpenAI1,445+/-614,547OpenAIProprietary
57xAIGrok 4.3 BetaxAI1,444+/-537,636xAIProprietary
58Qwen 3.6 Plus Preview阿里巴巴1,443+/-536,564阿里巴巴Proprietary
59OpenAIGPT-4o(2025-03-27)OpenAI1,443+/-382,424OpenAIProprietary
60Qwen3.5-397B-A17B阿里巴巴1,443+/-450,832阿里巴巴Apache 2.0
61GLM-4.7智谱AI1,442+/-612,105智谱AIMIT
62OpenAIGPT-5.1 InstantOpenAI1,439+/-443,455OpenAIProprietary
63DeepMindGemma 4 26B A4BDeepMind1,438+/-85,812DeepMindApache 2.0
64OpenAIGPT-5.2 Pro (high)OpenAI1,437+/-448,052OpenAIProprietary
65DeepSeek-AIDeepSeek-V4-Flash (thinking)DeepSeek-AI1,437+/-536,653DeepSeek-AIMIT
66DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1,436+/-536,817DeepSeek-AIMIT
67Meituanlongcat-flash-chat-2602-expMeituan1,436+/-528,162MeituanProprietary
68OpenAIGPT-5.2OpenAI1,435+/-368,870OpenAIProprietary
69Qwen3 Max (Preview)阿里巴巴1,435+/-527,707阿里巴巴Proprietary
70OpenAIGPT-5-Pro (high)OpenAI1,434+/-531,915OpenAIProprietary
71mimo-v2.5Xiaomi1,433+/-535,255XiaomiMIT
72Googlegemini-3.1-flash-lite-previewGoogle1,433+/-457,809GoogleProprietary
73Moonshot AIKimi K2.5 InstantMoonshot AI1,432+/-78,185Moonshot AIModified MIT
74xAIGrok 4.1 Fast (fast-reasoning)xAI1,431+/-356,849xAIProprietary
75OpenAIOpenAI o3OpenAI1,431+/-459,721OpenAIProprietary
76mimo-v2-omniXiaomi1,430+/-619,629XiaomiProprietary
77Moonshot AIKimi K2 Thinking (thinking-turbo)Moonshot AI1,430+/-362,069Moonshot AIModified MIT
78Amazonamazon-nova-experimental-chat-26-02-10Amazon1,427+/-103,421AmazonProprietary
79OpenAIGPT-5OpenAI1,427+/-431,551OpenAIProprietary
80Mistralmistral-medium-3.5Mistral1,427+/-710,782MistralModified MIT
81GLM-4.6智谱AI1,425+/-435,637智谱AIMIT
82DeepSeek-AIDeepSeek V3.2DeepSeek-AI1,425+/-447,254DeepSeek-AIMIT
83DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1,425+/-79,071DeepSeek-AIMIT
84AnthropicClaude Opus 4 (thinking-16k)Anthropic1,424+/-436,870AnthropicProprietary
85Alibabaqwen3-max-2025-09-23Alibaba1,424+/-69,151AlibabaProprietary
86Qwen3-235B-A22B-2507阿里巴巴1,423+/-397,195阿里巴巴Apache 2.0
87DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1,423+/-441,062DeepSeek-AIMIT
88DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1,423+/-611,921DeepSeek-AIMIT
89DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1,422+/-618,457DeepSeek-AIMIT
90xAIGrok 4 FastxAI1,421+/-86,808xAIProprietary
91Nvidianvidia-nemotron-3-ultra-550b-a55b-nvfp4Nvidia1,420+/-78,417NvidiaOpenMDW-1.1
92ERNIE 5.0百度1,419+/-94,705百度Proprietary
93DeepSeek-AIDeepSeek-V3.1 Terminus (thinking)DeepSeek-AI1,418+/-103,459DeepSeek-AIMIT
94Moonshot AIKimi K2 0905Moonshot AI1,418+/-611,772Moonshot AIModified MIT
95Moonshot AIKimi K2Moonshot AI1,417+/-527,627Moonshot AIModified MIT
96DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1,417+/-614,954DeepSeek-AIMIT
97DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1,417+/-711,731DeepSeek-AIMIT
98Qwen3.5-122B-A10B阿里巴巴1,417+/-428,563阿里巴巴Apache 2.0
99MiniMaxAIMiniMax-M2.7MiniMaxAI1,417+/-442,368MiniMaxAIModified MIT
100Amazonamazon-nova-experimental-chat-26-01-10Amazon1,416+/-103,410AmazonProprietary
101DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1,416+/-103,691DeepSeek-AIMIT
102MistralAIMistral Large 3MistralAI1,416+/-444,086MistralAIApache 2.0
103Qwen3-VL-235B-A22B-Instruct阿里巴巴1,415+/-611,501阿里巴巴Apache 2.0
104OpenAIGPT-4.1OpenAI1,414+/-450,955OpenAIProprietary
105Tencenthunyuan-hy3-previewTencent1,413+/-86,674Tencenttencent-hunyuan-community
106AnthropicClaude Opus 4Anthropic1,412+/-444,191AnthropicProprietary
107xAIGrok 3xAI1,412+/-432,900xAIProprietary
108AnthropicHaiku 4.5Anthropic1,411+/-3100,212AnthropicProprietary
109GLM-4.5智谱AI1,411+/-524,293智谱AIMIT
110Google Deep MindGemini 2.5 FlashGoogle Deep Mind1,410+/-2124,477Google Deep MindProprietary
111xAIGrok 4xAI1,410+/-441,363xAIProprietary
112MistralAIMagistral-Medium-2506MistralAI1,410+/-393,943MistralAIProprietary
113Qwen3.5-27B阿里巴巴1,409+/-427,408阿里巴巴Apache 2.0
114Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1,404+/-432,884Google Deep MindProprietary
115xAIGrok 4 Fast (fast-reasoning)xAI1,404+/-518,708xAIProprietary
116OpenAIGPT-5.4 nano (high)OpenAI1,403+/-448,005OpenAIProprietary
117Alibabaqwen3-235b-a22b-no-thinkingAlibaba1,403+/-538,191AlibabaApache 2.0
118OpenAIOpenAI o1OpenAI1,402+/-427,807OpenAIProprietary
119Qwen3-Next阿里巴巴1,401+/-522,865阿里巴巴Apache 2.0
120Meituanlongcat-flash-chatMeituan1,401+/-611,380MeituanMIT
121Alibabaqwen3-235b-a22b-thinking-2507Alibaba1,399+/-78,991AlibabaApache 2.0
122AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1,399+/-435,086AnthropicProprietary
123DeepSeek-AIDeepSeek-R1DeepSeek-AI1,398+/-518,524DeepSeek-AIMIT
124StepFunAIStep 3.5 FlashStepFunAI1,398+/-448,877StepFunAIProprietary
125Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴1,396+/-77,939阿里巴巴Apache 2.0
126Qwen3.5-35B-A3B阿里巴巴1,396+/-429,211阿里巴巴Apache 2.0
127DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1,396+/-445,491DeepSeek-AIMIT
128Tencenthunyuan-vision-1.5-thinkingTencent1,395+/-122,218TencentProprietary
129StepFunAIStep 3.5 FlashStepFunAI1,395+/-452,274StepFunAIApache 2.0
130Amazonamazon-nova-experimental-chat-12-10Amazon1,395+/-103,676AmazonProprietary
131mimo-v2-flash (non-thinking)Xiaomi1,393+/-446,660XiaomiMIT
132MiniMaxAIMiniMax M2.5MiniMaxAI1,391+/-441,241MiniMaxAIModified MIT
133OpenAIGPT-5-mini (high)OpenAI1,390+/-527,016OpenAIProprietary
134OpenAIOpenAI o4 - miniOpenAI1,390+/-445,433OpenAIProprietary
135AnthropicClaude Sonnet 4Anthropic1,389+/-440,294AnthropicProprietary
136OpenAIOpenAI o1OpenAI1,388+/-531,122OpenAIProprietary
137Qwen3-Coder-480B-A35B阿里巴巴1,388+/-525,709阿里巴巴Apache 2.0
138mimo-v2-flash (thinking)Xiaomi1,387+/-610,955XiaomiMIT
139AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1,387+/-438,819AnthropicProprietary
140Hunyuan-T1腾讯AI实验室1,387+/-94,701腾讯AI实验室Proprietary
141Mistralmistral-medium-2505Mistral1,387+/-533,206MistralProprietary
142MiniMaxAIM2.1MiniMaxAI1,384+/-517,112MiniMaxAIMIT
143Qwen3-30B-A3B-2507阿里巴巴1,383+/-523,720阿里巴巴Apache 2.0
144OpenAIGPT-4.1 miniOpenAI1,383+/-439,308OpenAIProprietary
145Tencenthunyuan-turbos-20250416Tencent1,382+/-610,722TencentProprietary
146Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind1,380+/-347,204Google Deep MindProprietary
147trinity-large-previewArcee AI1,379+/-430,115Arcee AIApache 2.0
148GLM-4.6V智谱AI1,377+/-112,802智谱AIMIT
149Qwen3-235B-A22B阿里巴巴1,375+/-526,259阿里巴巴Apache 2.0
150Google Deep MindGemini 2.5 Flash-Lite (thinking)Google Deep Mind1,374+/-532,886Google Deep MindProprietary
151Qwen2.5-Max阿里巴巴1,374+/-432,617阿里巴巴Proprietary
152GLM-4.5-Air智谱AI1,373+/-431,066智谱AIMIT
153AnthropicClaude 3.5 SonnetAnthropic1,373+/-388,333AnthropicProprietary
154AnthropicClaude Sonnet 3.7Anthropic1,371+/-443,169AnthropicProprietary
155Qwen3-Next (thinking)阿里巴巴1,370+/-613,686阿里巴巴Apache 2.0
156trinity-large-thinkingArcee AI1,369+/-529,229Arcee AIApache 2.0
157GLM-4.7-Flash智谱AI1,368+/-611,724智谱AIMIT
158Amazonamazon-nova-experimental-chat-11-10Amazon1,367+/-425,376AmazonProprietary
159Google Deep MindGemma 3 - 27B (IT)Google Deep Mind1,366+/-447,513Google Deep MindGemma
160MiniMaxminimax-m1MiniMax1,364+/-435,177MiniMaxApache 2.0
161OpenAIOpenAI o3-mini (high)OpenAI1,363+/-518,589OpenAIProprietary
162OpenAIOpenAI o3-mini (high)OpenAI1,362+/-516,953OpenAIProprietary
163Nvidianvidia-nemotron-3-super-120b-a12bNvidia1,362+/-77,547NvidiaNVIDIA Open Model
164DeepMindGemini 2.0 Flash ExperimentalDeepMind1,360+/-443,740DeepMindProprietary
165DeepSeek-AIDeepSeek-V3DeepSeek-AI1,358+/-521,770DeepSeek-AIDeepSeek
166MistralAIMistral-Small-3.2MistralAI1,358+/-517,698MistralAIApache 2.0
167xAIgrok-3-mini-betaxAI1,357+/-522,703xAIProprietary
168intellect-3Prime Intellect1,356+/-85,329Prime IntellectMIT
169CohereAIC4AI Command A (202503)CohereAI1,354+/-356,234CohereAICC-BY-NC-4.0
170GLM-4.5V智谱AI1,354+/-84,956智谱AIMIT
171DeepMindGemini 2.0 Flash-LiteDeepMind1,353+/-424,955DeepMindProprietary
172OpenAIGPT OSS 120BOpenAI1,353+/-430,639OpenAIApache 2.0
173Google Deep MindGemini 1.5 ProGoogle Deep Mind1,351+/-355,606Google Deep MindProprietary
174Amazonamazon-nova-experimental-chat-10-20Amazon1,350+/-611,469AmazonProprietary
175Tencenthunyuan-turbos-20250226Tencent1,349+/-122,220TencentProprietary
176StepFunAIStep3StepFunAI1,348+/-76,537StepFunAIApache 2.0
177Amazonamazon-nova-experimental-chat-10-09Amazon1,348+/-112,832AmazonProprietary
178OpenAIOpenAI o3-miniOpenAI1,348+/-457,325OpenAIProprietary
179Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1,347+/-122,549NvidiaNvidia Open Model
180Qwen3-32B阿里巴巴1,347+/-93,926阿里巴巴Apache 2.0
181mercury-2Inception AI1,347+/-113,123Inception AIProprietary
182ling-flash-2.0InclusionAI1,346+/-77,003InclusionAIMIT
183Alibabaqwen-plus-0125Alibaba1,346+/-85,819AlibabaProprietary
184MiniMaxAIMiniMax M2MiniMaxAI1,346+/-86,864MiniMaxAIApache 2.0
185OpenAIGPT-4oOpenAI1,346+/-3112,881OpenAIProprietary
186Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1,343+/-103,346NvidiaNvidia Open
187glm-4-plus-0111Zhipu1,343+/-85,760ZhipuProprietary
188AnthropicClaude 3.5 SonnetAnthropic1,342+/-382,419AnthropicProprietary
189Google Deep MindGemma 3 - 12B (IT)Google Deep Mind1,342+/-103,829Google Deep MindGemma
190Tencenthunyuan-turbo-0110Tencent1,341+/-122,290TencentProprietary
191OpenAIGPT-5-Nano (high)OpenAI1,337+/-78,257OpenAIProprietary
192OpenAIOpenAI o1-miniOpenAI1,337+/-451,981OpenAIProprietary
193Nova 2 Lite亚马逊1,337+/-612,233亚马逊Proprietary
194QwQ-32B阿里巴巴1,336+/-425,382阿里巴巴Apache 2.0
195xAIGrok 2xAI1,336+/-463,498xAIProprietary
196Googlegemini-advanced-0514Google1,335+/-550,148GoogleProprietary
197OpenAIGPT-4oOpenAI1,335+/-445,499OpenAIProprietary
198Metallama-3.1-405b-instruct-bf16Meta1,335+/-441,375MetaLlama 3.1 Community
199StepFunstep-2-16k-exp-202412StepFun1,334+/-94,833StepFunProprietary
200Metallama-3.1-405b-instruct-fp8Meta1,333+/-459,656MetaLlama 3.1 Community
201olmo-3.1-32b-instructAi21,330+/-612,217Ai2Apache 2.0
202molmo-2-8bAi21,328+/-21803Ai2Apache 2.0
203yi-lightning01 AI1,328+/-527,33201 AIProprietary
204Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1,328+/-122,218NvidiaNvidia
205Qwen3-30B-A3B阿里巴巴1,327+/-526,474阿里巴巴Apache 2.0
206Llama 4 Maverick InstructFacebook AI研究实验室1,327+/-439,967Facebook AI研究实验室Llama 4
207Tencenthunyuan-large-2025-02-10Tencent1,326+/-103,738TencentProprietary
208OpenAIgpt-4-turbo-2024-04-09OpenAI1,324+/-498,114OpenAIProprietary
209AnthropicClaude 3.5 HaikuAnthropic1,324+/-369,947AnthropicProprietary
210Google Deep MindGemini 1.5 ProGoogle Deep Mind1,324+/-479,138Google Deep MindProprietary
211DeepSeekdeepseek-v2.5-1210DeepSeek1,323+/-86,795DeepSeekDeepSeek
212Llama 4 Scout InstructFacebook AI研究实验室1,323+/-530,275Facebook AI研究实验室Llama
213OpenAIGPT-4.1 nanoOpenAI1,322+/-86,103OpenAIProprietary
214AnthropicClaude3-OpusAnthropic1,321+/-3194,909AnthropicProprietary
215ring-flash-2.0InclusionAI1,321+/-77,137InclusionAIMIT
216StepFunstep-1o-turbo-202506StepFun1,320+/-79,033StepFunProprietary
217glm-4-plusZhipu AI1,319+/-526,126Zhipu AIProprietary
218Llama3.3-70B-InstructFacebook AI研究实验室1,318+/-354,732Facebook AI研究实验室Llama-3.3
219Google Deep MindGemma-3n-E4BGoogle Deep Mind1,318+/-522,578Google Deep MindGemma
220Alibabaqwen-max-0919Alibaba1,318+/-616,478AlibabaQwen
221OpenAIGPT-4o miniOpenAI1,318+/-468,709OpenAIProprietary
222OpenAIGPT OSS 20BOpenAI1,317+/-610,625OpenAIApache 2.0
223Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1,316+/-615,509NvidiaNVIDIA Open Model
224Alibabaqwen2.5-plus-1127Alibaba1,315+/-610,187AlibabaProprietary
225athene-v2-chatNexusFlow1,314+/-524,739NexusFlowNexusFlow
226Mistralmistral-large-2407Mistral1,314+/-445,459MistralMistral Research
227OpenAIGPT-4OpenAI1,313+/-493,439OpenAIProprietary
228OpenAIGPT-4OpenAI1,312+/-4100,105OpenAIProprietary
229Tencenthunyuan-standard-2025-02-10Tencent1,311+/-103,904TencentProprietary
230Googlegemini-1.5-flash-002Google1,309+/-434,902GoogleProprietary
231xAIgrok-2-mini-2024-08-13xAI1,308+/-452,567xAIProprietary
232DeepSeek-AIDeepSeek V2.5DeepSeek-AI1,307+/-524,572DeepSeek-AIDeepSeek
233granite-4.1-8bIBM1,307+/-104,063IBMApache 2.0
234athene-70b-0725NexusFlow1,306+/-619,621NexusFlowCC-BY-NC-4.0
235mercuryInception AI1,306+/-141,955Inception AIProprietary
236olmo-3-32b-thinkAi21,306+/-85,942Ai2Apache 2.0
237Mistralmistral-large-2411Mistral1,305+/-428,073MistralMRL
238MistralAIMagistral-Medium-2506MistralAI1,304+/-611,630MistralAIProprietary
239MistralAIMistral-Small-3.1-24B-Instruct-2503MistralAI1,303+/-533,204MistralAIApache 2.0
240Google Deep MindGemma 3 - 4B (IT)Google Deep Mind1,303+/-94,171Google Deep MindGemma
241Qwen2.5-VL-72B-Instruct阿里巴巴1,303+/-439,406阿里巴巴Qwen
242Llama3.1-70B-InstructFacebook AI研究实验室1,299+/-87,140Facebook AI研究实验室Llama 3.1
243Tencenthunyuan-large-visionTencent1,294+/-95,369TencentProprietary
244Llama3.1-70B-InstructFacebook AI研究实验室1,293+/-455,240Facebook AI研究实验室Llama 3.1 Community
245Amazonamazon-nova-pro-v1.0Amazon1,290+/-524,745AmazonProprietary
246jamba-1.5-largeAI21 Labs1,289+/-78,662AI21 LabsJamba Open
247Googlegemma-2-27b-itGoogle1,289+/-375,754GoogleGemma license
248reka-core-20240904Reka AI1,288+/-77,312Reka AIProprietary
249ibm-granite-h-smallIBM1,287+/-85,680IBMApache 2.0
250OpenAIGPT-4OpenAI1,287+/-554,173OpenAIProprietary
251Googlegemini-1.5-flash-001Google1,286+/-562,833GoogleProprietary
252llama-3.1-tulu-3-70bAi21,286+/-102,846Ai2Llama 3.1
253Nvidiallama-3.1-nemotron-51b-instructNvidia1,286+/-103,749NvidiaLlama 3.1
254olmo-3.1-32b-thinkAi21,285+/-78,503Ai2Apache 2.0
255AnthropicClaude3-SonnetAnthropic1,280+/-4109,284AnthropicProprietary
256gemma-2-9b-it-simpoPrinceton1,280+/-710,072PrincetonMIT
257Nvidianemotron-4-340b-instructNvidia1,276+/-519,659NvidiaNVIDIA Open Model
258Llama3-70B-InstructFacebook AI研究实验室1,276+/-4156,876Facebook AI研究实验室Llama 3 Community
259Coherecommand-r-plus-08-2024Cohere1,276+/-79,866CohereCC-BY-NC-4.0
260OpenAIGPT-4OpenAI1,275+/-488,723OpenAIProprietary
261MistralAIMistral Small 24B Instruct 2501MistralAI1,274+/-614,681MistralAIApache 2.0
262GLM4智谱AI1,273+/-79,788智谱AIProprietary
263reka-flash-20240904Reka AI1,272+/-77,536Reka AIProprietary
264Qwen2.5-Coder-32B-Instruct阿里巴巴1,270+/-85,432阿里巴巴Apache 2.0
265CohereAIC4AI Aya Vision 32BCohereAI1,267+/-527,124CohereAICC-BY-NC-4.0
266Googlegemma-2-9b-itGoogle1,266+/-454,611GoogleGemma license
267DeepSeekdeepseek-coder-v2DeepSeek1,264+/-615,147DeepSeekDeepSeek License
268Qwen2-72B-Instruct阿里巴巴1,261+/-537,325阿里巴巴Qianwen LICENSE
269CohereAIC4AI Command R+CohereAI1,261+/-477,554CohereAICC-BY-NC-4.0
270AnthropicClaude3-HaikuAnthropic1,261+/-4117,701AnthropicProprietary
271Amazonamazon-nova-lite-v1.0Amazon1,260+/-519,372AmazonProprietary
272Googlegemini-1.5-flash-8b-001Google1,259+/-435,558GoogleProprietary
273Microsoft AzurePhi 4 - 14BMicrosoft Azure1,256+/-524,126Microsoft AzureMIT
274olmo-2-0325-32b-instructAi21,251+/-113,334Ai2Apache-2.0
275Coherecommand-r-08-2024Cohere1,250+/-710,140CohereCC-BY-NC-4.0
276Mistralmistral-large-2402Mistral1,242+/-562,436MistralProprietary
277Amazonamazon-nova-micro-v1.0Amazon1,241+/-519,364AmazonProprietary
278jamba-1.5-miniAI21 Labs1,239+/-78,858AI21 LabsJamba Open
279Mistralministral-8b-2410Mistral1,237+/-94,781MistralMRL
280Googlegemini-pro-dev-apiGoogle1,236+/-718,354GoogleProprietary
281Qwen1.5-110B-Chat阿里巴巴1,233+/-626,195阿里巴巴Qianwen LICENSE
282Tencenthunyuan-standard-256kTencent1,233+/-122,728TencentProprietary
283reka-flash-21b-20240226-onlineReka AI1,233+/-715,450Reka AIProprietary
284Qwen1.5-72B-Chat阿里巴巴1,233+/-539,302阿里巴巴Qianwen LICENSE
285MistralAIMixtral-8x22B-Instruct-v0.1MistralAI1,229+/-551,416MistralAIApache 2.0
286Coherecommand-rCohere1,226+/-554,036CohereCC-BY-NC-4.0
287reka-flash-21b-20240226Reka AI1,226+/-624,806Reka AIProprietary
288OpenAIgpt-3.5-turbo-0125OpenAI1,224+/-566,207OpenAIProprietary
289Llama3-8B-InstructFacebook AI研究实验室1,223+/-4104,642Facebook AI研究实验室Llama 3 Community
290CohereAIC4AI Aya Vision 8BCohereAI1,223+/-79,818CohereAICC-BY-NC-4.0
291DeepMindGemini-proDeepMind1,222+/-126,390DeepMindProprietary
292Mistralmistral-mediumMistral1,222+/-534,550MistralProprietary
293llama-3.1-tulu-3-8bAi21,220+/-112,896Ai2Llama 3.1
294Yi-1.5-34B零一万物1,212+/-524,146零一万物Apache-2.0
295zephyr-orpo-141b-A35b-v0.1HuggingFace1,212+/-114,652HuggingFaceApache 2.0
296Llama3.1-8B-InstructFacebook AI研究实验室1,211+/-449,605Facebook AI研究实验室Llama 3.1 Community
297Llama3.1-8B-InstructFacebook AI研究实验室1,208+/-113,090Facebook AI研究实验室Apache 2.0
298Alibabaqwen1.5-32b-chatAlibaba1,203+/-621,741AlibabaQianwen LICENSE
299OpenAIgpt-3.5-turbo-1106OpenAI1,202+/-916,619OpenAIProprietary
300Googlegemma-2-2b-itGoogle1,200+/-446,616GoogleGemma license
301Microsoft AzurePhi-3-medium 14B-previewMicrosoft Azure1,197+/-525,055Microsoft AzureMIT
302Mistralmixtral-8x7b-instruct-v0.1Mistral1,196+/-473,503MistralApache 2.0
303DBRX Instructdatabricks1,194+/-632,191databricksDBRX LICENSE
304InternLM2-Base-20B上海人工智能实验室1,191+/-79,901上海人工智能实验室Other
305Qwen1.5-14B-Chat阿里巴巴1,190+/-717,839阿里巴巴Qianwen LICENSE
306WizardLM-70B-V1.0WizardLM Team1,184+/-98,214WizardLM TeamLlama 2 Community
307DeepSeek-AIDeepSeek LLM 67B ChatDeepSeek-AI1,184+/-114,932DeepSeek-AIDeepSeek License
308Yi-34B零一万物1,183+/-715,483零一万物Yi License
309granite-3.0-8b-instructIBM1,182+/-96,638IBMApache 2.0
310openchat-3.5OpenChat1,182+/-107,968OpenChatApache-2.0
311openchat-3.5-0106OpenChat1,182+/-812,637OpenChatApache-2.0
312Google ResearchGemma 1.1-7B-ITGoogle Research1,181+/-623,893Google ResearchGemma license
313snowflake-arctic-instructSnowflake1,179+/-632,832SnowflakeApache 2.0
314granite-3.1-2b-instructIBM1,178+/-113,188IBMApache 2.0
315tulu-2-dpo-70bAllenAI/UW1,177+/-106,535AllenAI/UWAI2 ImpACT Low-risk
316openhermes-2.5-mistral-7bNousResearch1,175+/-105,006NousResearchApache-2.0
317Vicuna 33BLM-SYS1,172+/-622,479LM-SYSNon-commercial
318starling-lm-7b-betaNexusflow1,171+/-716,056NexusflowApache-2.0
319Microsoft AzurePhi-3-small 7BMicrosoft Azure1,170+/-617,766Microsoft AzureMIT
320Metallama-2-70b-chatMeta1,170+/-638,492MetaLlama 2 Community
321starling-lm-7b-alphaUC Berkeley1,167+/-810,224UC BerkeleyCC-BY-NC-4.0
322Metallama-3.2-3b-instructMeta1,166+/-87,936MetaLlama 3.2
323nous-hermes-2-mixtral-8x7b-dpoNousResearch1,164+/-123,777NousResearchApache-2.0
324Qwen3-VL-2B阿里巴巴1,156+/-86,837阿里巴巴Apache 2.0
325QwQ-32B-Preview阿里巴巴1,155+/-123,231阿里巴巴Apache 2.0
326Nvidiallama2-70b-steerlm-chatNvidia1,154+/-133,585NvidiaLlama 2 Community
327solar-10.7b-instruct-v1.0Upstage AI1,151+/-134,155Upstage AICC-BY-NC-4.0
328dolphin-2.2.1-mistral-7bCognitive Computations1,151+/-151,679Cognitive ComputationsApache-2.0
329MPT-30B-ChatMosaicML1,150+/-122,572MosaicMLCC-BY-NC-SA-4.0
330MistralAIMistral-7B-Instruct-v0.2MistralAI1,149+/-719,402MistralAIApache-2.0
331Microsoftwizardlm-13bMicrosoft1,148+/-97,044MicrosoftLlama 2 Community
332falcon-180b-chatTII1,147+/-171,295TIIFalcon-180B TII License
333Qwen1.5-7B-Chat阿里巴巴1,143+/-104,737阿里巴巴Qianwen LICENSE
334Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,142+/-612,297Microsoft AzureMIT
335Baichuan2-13B-Chat百川智能1,141+/-719,174百川智能Llama 2 Community
336Vicuna 13BLM-SYS1,140+/-719,367LM-SYSLlama 2 Community
337Qwen-14B-Chat阿里巴巴1,138+/-114,964阿里巴巴Qianwen LICENSE
338Google ResearchPaLM 2Google Research1,137+/-98,554Google ResearchProprietary
339Google ResearchGemma 7B - ItGoogle Research1,137+/-98,925Google ResearchGemma license
340CodeLLaMA-34BFacebook AI研究实验室1,136+/-97,366Facebook AI研究实验室Llama 2 Community
341zephyr-7b-betaHuggingFace1,130+/-911,118HuggingFaceMIT
342Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,129+/-720,685Microsoft AzureMIT
343Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,127+/-620,118Microsoft AzureMIT
344guanaco-33bUW1,126+/-122,921UWNon-commercial
345zephyr-7b-alphaHuggingFace1,126+/-161,785HuggingFaceMIT
346stripedhyena-nous-7bTogether AI1,120+/-115,182Together AIApache 2.0
347CodeLlama-70B-InstructFacebook AI研究实验室1,118+/-181,143Facebook AI研究实验室Llama 2 Community
348Google ResearchGemma 1.1-2B-ITGoogle Research1,115+/-810,854Google ResearchGemma license
349Vicuna 7BLM-SYS1,114+/-96,923LM-SYSLlama 2 Community
350smollm2-1.7b-instructHuggingFace1,114+/-142,199HuggingFaceApache 2.0
351Metallama-3.2-1b-instructMeta1,110+/-88,045MetaLlama 3.2
352MistralAIMistral 7B InstructMistralAI1,109+/-98,977MistralAIApache 2.0
353Baichuan2-7B-Chat百川智能1,107+/-714,148百川智能Llama 2 Community
354Google ResearchGemma 2B - ItGoogle Research1,092+/-114,780Google ResearchGemma license
355Qwen1.5-4B-Chat阿里巴巴1,090+/-97,597阿里巴巴Qianwen LICENSE
356olmo-7b-instructAi21,073+/-116,328Ai2Apache-2.0
357Koala达摩院1,070+/-106,965达摩院Non-commercial
358alpaca-13bStanford1,068+/-115,745StanfordNon-commercial
359GPT4All 13BNomic AI1,066+/-151,743Nomic AINon-commercial
360MPT-7B-ChatMosaicML1,062+/-123,924MosaicMLCC-BY-NC-SA-4.0
361ChatGLM3-6B智谱AI1,055+/-124,658智谱AIApache-2.0
362RWKV-4-Raven-14BRWKV1,041+/-114,845RWKVApache 2.0
363ChatGLM2-6B智谱AI1,023+/-142,658智谱AIApache-2.0
364oasst-pythia-12bOpenAssistant1,022+/-116,310OpenAssistantApache 2.0
365ChatGLM-6B智谱AI995+/-134,914智谱AINon-commercial
366fastchat-t5-3bLMSYS991+/-124,203LMSYSApache 2.0
367dolly-v2-12bDatabricks980+/-143,412DatabricksMIT
368LLaMA 13BFacebook AI研究实验室973+/-162,391Facebook AI研究实验室Non-commercial
369stablelm-tuned-alpha-7bStability AI952+/-133,287Stability AICC-BY-NC-SA-4.0

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

01

什么是 Text Generation Arena (LMArena)?

Text Generation Arena(原 LMSYS Chatbot Arena)是目前最具影响力的大模型匿名评测平台。用户向两个身份未知的模型提问,根据回答质量投票,系统通过 Elo 算法将数百万次投票汇聚为动态排行榜,被学术界和工业界广泛引用。

02

Arena Elo 分数是如何计算的?

Elo 算法源自国际象棋评分体系。每次对战后,胜者得分上升、败者下降,幅度取决于双方原始评分差距。95% 置信区间(CI)反映该模型参与对战次数的多少:CI 越窄说明数据越充分、排名越可信。

03

为什么同一模型会出现"Thinking"和普通两个版本?

部分模型支持"扩展思考"(Extended Thinking)模式,会在给出最终答案前进行更深入的内部推理。该模式通常在逻辑推理、数学和编程任务上得分更高,但响应时延也更长、成本更高。Arena 将两种模式分开评测,以便用户根据实际需求选择。

04

如何根据排行榜选择适合自己的大语言模型?

建议综合考虑:综合性能(看 Elo 总分)、成本(闭源 API 按量计费,开源可自部署)、中文支持开源程度以及响应速度