DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Text Generation Arena 文本生成模型排行榜

LMArena 评测赛道

文本生成代码数学图像编辑文字生成视频图生视频文生图

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Opus 4.7 (thinking)

最高得分

1,503

模型数量

357

数据版本

2026年05月07日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicOpus 4.7 (thinking)Anthropic1,503+/-68,945AnthropicProprietary
AnthropicClaude Opus 4.6 (thinking)Anthropic1,502+/-523,616AnthropicProprietary
AnthropicClaude Opus 4.6Anthropic1,498+/-525,089AnthropicProprietary
4Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1,492+/-429,468Google Deep MindProprietary
5AnthropicOpus 4.7Anthropic1,491+/-69,614AnthropicProprietary
6FAMuse SparkFacebook AI研究实验室1,490+/-610,491Facebook AI研究实验室Proprietary
7Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1,486+/-441,381Google Deep MindProprietary
8OpenAIgpt-5.5-highOpenAI1,484+/-76,488OpenAIProprietary
9xAIgrok-4.20-beta1xAI1,480+/-518,791xAIProprietary
10OpenAIgpt-5.2-chat-latest-20260210OpenAI1,477+/-523,717OpenAIProprietary
11OpenAIgpt-5.4-highOpenAI1,477+/-517,146OpenAIProprietary
12xAIgrok-4.20-beta-0309-reasoningxAI1,477+/-517,538xAIProprietary
13OpenAIGPT-5.5OpenAI1,475+/-76,653OpenAIProprietary
14Baiduernie-5.1Baidu1,474+/-85,733BaiduProprietary
15xAIgrok-4.20-multi-agent-beta-0309xAI1,474+/-517,728xAIProprietary
16Google Deep MindGemini 3.0 FlashGoogle Deep Mind1,474+/-430,784Google Deep MindProprietary
17AnthropicClaude Opus 4 (thinking-32k)Anthropic1,473+/-437,168AnthropicProprietary
18OpenAIgpt-5.5-instantOpenAI1,473+/-112,833OpenAIProprietary
19智谱GLM 5.1智谱AI1,471+/-611,349智谱AIMIT
20AnthropicClaude Opus 4Anthropic1,468+/-354,886AnthropicProprietary
21OpenAIGPT-5.4OpenAI1,468+/-517,925OpenAIProprietary
22xAIGrok 4.1 ThinkingxAI1,467+/-355,257xAIProprietary
23AnthropicClaude Sonnet 4.6Anthropic1,466+/-517,127AnthropicProprietary
24XImimo-v2.5-proXiaomi1,464+/-76,238XiaomiMIT
25Alibabaqwen3.5-max-previewAlibaba1,464+/-514,558AlibabaProprietary
26Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1,463+/-441,346Google Deep MindProprietary
27DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1,463+/-94,160DeepSeek-AIMIT
28Moonshot AIKimi K2.6Moonshot AI1,462+/-77,108Moonshot AIModified MIT
29DeepSeekdeepseek-v4-pro-thinkingDeepSeek1,462+/-93,808DeepSeekMIT
30xAIGrok 4.1xAI1,459+/-359,206xAIProprietary
31Bytedancedola-seed-2.0-proBytedance1,459+/-526,587BytedanceProprietary
32阿里Qwen3.6-Max-Preview阿里巴巴1,457+/-93,965阿里巴巴Proprietary
33智谱GLM-5智谱AI1,457+/-520,292智谱AIMIT
34OpenAIgpt-5.4-mini-highOpenAI1,456+/-514,952OpenAIProprietary
35xAIGrok 4.3 BetaxAI1,455+/-85,234xAIProprietary
36OpenAIGPT-5.1 Pro (high)OpenAI1,455+/-440,891OpenAIProprietary
37AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1,454+/-367,180AnthropicProprietary
38AnthropicClaude Sonnet 4.5Anthropic1,453+/-365,214AnthropicProprietary
39DeepMindGemma 4 31BDeepMind1,451+/-85,827DeepMindApache 2.0
40百度ERNIE 5.0百度1,450+/-428,724百度Proprietary
41Moonshot AIKimi K2 ThinkingMoonshot AI1,449+/-427,282Moonshot AIModified MIT
42百度ERNIE 5.0百度1,449+/-79,764百度Proprietary
43AnthropicOpus 4.1 (thinking-16k)Anthropic1,449+/-349,850AnthropicProprietary
44OpenAIgpt-5.3-chat-latestOpenAI1,449+/-522,474OpenAIProprietary
45Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1,448+/-3114,865Google Deep MindProprietary
46阿里Qwen 3.6 Plus Preview阿里巴巴1,448+/-68,683阿里巴巴Proprietary
47AnthropicOpus 4.1Anthropic1,447+/-377,425AnthropicProprietary
48XImimo-v2-proXiaomi1,447+/-515,257XiaomiProprietary
49阿里Qwen3.5-397B-A17B阿里巴巴1,446+/-522,471阿里巴巴Apache 2.0
50OpenAIGPT-4.5OpenAI1,444+/-614,547OpenAIProprietary
51OpenAIchatgpt-4o-latest-20250326OpenAI1,443+/-382,527OpenAIProprietary
52智谱GLM-4.7智谱AI1,443+/-612,142智谱AIMIT
53DeepSeekdeepseek-v4-flash-thinkingDeepSeek1,440+/-93,600DeepSeekMIT
54OpenAIGPT-5.2 Pro (high)OpenAI1,440+/-438,067OpenAIProprietary
55OpenAIGPT-5.1 InstantOpenAI1,439+/-443,533OpenAIProprietary
56Googlegemini-3.1-flash-lite-previewGoogle1,438+/-523,715GoogleProprietary
57DeepMindGemma 4 26B A4BDeepMind1,438+/-85,782DeepMindApache 2.0
58OpenAIGPT-5.2OpenAI1,437+/-435,182OpenAIProprietary
59阿里Qwen3 Max (Preview)阿里巴巴1,435+/-527,743阿里巴巴Proprietary
60Meituanlongcat-flash-chat-2602-expMeituan1,434+/-613,311MeituanProprietary
61OpenAIGPT-5-Pro (high)OpenAI1,434+/-531,963OpenAIProprietary
62DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1,433+/-93,506DeepSeek-AIMIT
63Moonshotkimi-k2.5-instantMoonshot1,432+/-78,207MoonshotModified MIT
64xAIgrok-4-1-fast-reasoningxAI1,432+/-350,028xAIProprietary
65OpenAIOpenAI o3OpenAI1,431+/-459,783OpenAIProprietary
66Moonshot AIKimi K2 Thinking (thinking-turbo)Moonshot AI1,430+/-352,935Moonshot AIModified MIT
67Amazonamazon-nova-experimental-chat-26-02-10Amazon1,428+/-103,424AmazonProprietary
68OpenAIGPT-5OpenAI1,426+/-431,617OpenAIProprietary
69智谱GLM-4.6智谱AI1,426+/-435,694智谱AIMIT
70DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1,425+/-79,076DeepSeek-AIMIT
71DeepSeek-AIDeepSeek V3.2DeepSeek-AI1,424+/-444,820DeepSeek-AIMIT
72Alibabaqwen3-max-2025-09-23Alibaba1,424+/-69,179AlibabaProprietary
73AnthropicClaude Opus 4 (thinking-16k)Anthropic1,424+/-436,937AnthropicProprietary
74DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1,423+/-611,943DeepSeek-AIMIT
75XImimo-v2.5Xiaomi1,423+/-76,300XiaomiMIT
76阿里Qwen3-235B-A22B-2507阿里巴巴1,423+/-388,518阿里巴巴Apache 2.0
77DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1,422+/-439,071DeepSeek-AIMIT
78DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1,422+/-618,469DeepSeek-AIMIT
79xAIGrok 4 FastxAI1,421+/-86,823xAIProprietary
80百度ERNIE 5.0百度1,419+/-94,715百度Proprietary
81阿里Qwen3.5-122B-A10B阿里巴巴1,418+/-519,379阿里巴巴Apache 2.0
82Tencenthunyuan-hy3-previewTencent1,418+/-84,582Tencenttencent-hunyuan-community
83Moonshot AIKimi K2 0905Moonshot AI1,418+/-611,798Moonshot AIModified MIT
84DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1,418+/-614,985DeepSeek-AIMIT
85Moonshot AIKimi K2Moonshot AI1,417+/-527,644Moonshot AIModified MIT
86DeepSeekdeepseek-v3.1-terminus-thinkingDeepSeek1,417+/-103,474DeepSeekMIT
87DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1,417+/-711,754DeepSeek-AIMIT
88DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1,416+/-103,713DeepSeek-AIMIT
89阿里Qwen3-VL-235B-A22B-Instruct阿里巴巴1,415+/-611,529阿里巴巴Apache 2.0
90Amazonamazon-nova-experimental-chat-26-01-10Amazon1,415+/-103,418AmazonProprietary
91MistralAIMistral Large 3MistralAI1,415+/-441,365MistralAIApache 2.0
92OpenAIGPT-4.1OpenAI1,413+/-451,035OpenAIProprietary
93AnthropicClaude Opus 4Anthropic1,412+/-444,244AnthropicProprietary
94xAIGrok 3xAI1,412+/-432,916xAIProprietary
95智谱GLM-4.5智谱AI1,411+/-524,336智谱AIMIT
96Google Deep MindGemini 2.5 FlashGoogle Deep Mind1,411+/-3114,591Google Deep MindProprietary
97xAIGrok 4xAI1,410+/-441,416xAIProprietary
98MistralAIMagistral-Medium-2506MistralAI1,409+/-384,463MistralAIProprietary
99AnthropicHaiku 4.5Anthropic1,409+/-367,007AnthropicProprietary
100MiniMaxAIMiniMax-M2.7MiniMaxAI1,407+/-613,525MiniMaxAIModified MIT
101阿里Qwen3.5-27B阿里巴巴1,406+/-518,942阿里巴巴Apache 2.0
102OpenAIgpt-5.4-nano-highOpenAI1,406+/-514,363OpenAIProprietary
103Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1,405+/-432,938Google Deep MindProprietary
104xAIgrok-4-fast-reasoningxAI1,404+/-518,737xAIProprietary
105Alibabaqwen3-235b-a22b-no-thinkingAlibaba1,403+/-538,241AlibabaApache 2.0
106阿里Qwen3-Next阿里巴巴1,402+/-522,883阿里巴巴Apache 2.0
107OpenAIo1-2024-12-17OpenAI1,402+/-427,807OpenAIProprietary
108Meituanlongcat-flash-chatMeituan1,401+/-611,409MeituanMIT
109Alibabaqwen3-235b-a22b-thinking-2507Alibaba1,399+/-79,004AlibabaApache 2.0
110AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1,399+/-435,132AnthropicProprietary
111StepFunAIStep 3.5 FlashStepFunAI1,398+/-519,649StepFunAIProprietary
112DeepSeek-AIDeepSeek-R1DeepSeek-AI1,398+/-518,524DeepSeek-AIMIT
113阿里Qwen3.5-35B-A3B阿里巴巴1,397+/-519,774阿里巴巴Apache 2.0
114Tencenthunyuan-vision-1.5-thinkingTencent1,396+/-122,221TencentProprietary
115阿里Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴1,396+/-77,944阿里巴巴Apache 2.0
116Amazonamazon-nova-experimental-chat-12-10Amazon1,395+/-103,690AmazonProprietary
117DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1,395+/-445,533DeepSeek-AIMIT
118MiniMaxAIMiniMax M2.5MiniMaxAI1,395+/-424,885MiniMaxAIModified MIT
119StepFunAIStep 3.5 FlashStepFunAI1,393+/-425,112StepFunAIApache 2.0
120XImimo-v2-flash (non-thinking)Xiaomi1,393+/-437,247XiaomiMIT
121Microsoft AImai-1-previewMicrosoft AI1,393+/-517,899Microsoft AIProprietary
122OpenAIgpt-5-mini-highOpenAI1,390+/-527,053OpenAIProprietary
123OpenAIOpenAI o4 - miniOpenAI1,390+/-445,463OpenAIProprietary
124AnthropicClaude Sonnet 4Anthropic1,389+/-440,351AnthropicProprietary
125OpenAIOpenAI o1OpenAI1,388+/-531,122OpenAIProprietary
126XImimo-v2-flash (thinking)Xiaomi1,388+/-610,982XiaomiMIT
127腾讯Hunyuan-T1腾讯AI实验室1,387+/-94,710腾讯AI实验室Proprietary
128阿里Qwen3-Coder-480B-A35B阿里巴巴1,387+/-525,757阿里巴巴Apache 2.0
129AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1,387+/-438,841AnthropicProprietary
130Mistralmistral-medium-2505Mistral1,386+/-533,244MistralProprietary
131MiniMaxAIM2.1MiniMaxAI1,385+/-517,165MiniMaxAIMIT
132阿里Qwen3-30B-A3B-2507阿里巴巴1,383+/-523,766阿里巴巴Apache 2.0
133OpenAIGPT-4.1 miniOpenAI1,382+/-439,353OpenAIProprietary
134Tencenthunyuan-turbos-20250416Tencent1,382+/-610,723TencentProprietary
135ARtrinity-large-thinkingArcee AI1,380+/-612,239Arcee AIApache 2.0
136Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind1,380+/-347,285Google Deep MindProprietary
137智谱GLM-4.6V智谱AI1,378+/-112,810智谱AIMIT
138ARtrinity-large-previewArcee AI1,375+/-520,978Arcee AIApache 2.0
139阿里Qwen3-235B-A22B阿里巴巴1,375+/-526,284阿里巴巴Apache 2.0
140Google Deep MindGemini 2.5 Flash-Lite (thinking)Google Deep Mind1,374+/-532,947Google Deep MindProprietary
141阿里Qwen2.5-Max阿里巴巴1,374+/-432,625阿里巴巴Proprietary
142智谱GLM-4.5-Air智谱AI1,373+/-431,119智谱AIMIT
143AnthropicClaude 3.5 SonnetAnthropic1,372+/-388,359AnthropicProprietary
144AnthropicClaude Sonnet 3.7Anthropic1,371+/-443,206AnthropicProprietary
145阿里Qwen3-Next (thinking)阿里巴巴1,369+/-613,707阿里巴巴Apache 2.0
146智谱GLM-4.7-Flash智谱AI1,368+/-611,763智谱AIMIT
147Amazonamazon-nova-experimental-chat-11-10Amazon1,367+/-425,445AmazonProprietary
148Google Deep MindGemma 3 - 27B (IT)Google Deep Mind1,366+/-447,569Google Deep MindGemma
149MiniMaxminimax-m1MiniMax1,363+/-435,233MiniMaxApache 2.0
150OpenAIo3-mini-highOpenAI1,363+/-518,589OpenAIProprietary
151OpenAIOpenAI o3-mini (high)OpenAI1,362+/-516,977OpenAIProprietary
152Nvidianvidia-nemotron-3-super-120b-a12bNvidia1,361+/-77,419NvidiaNVIDIA Open Model
153DeepMindGemini 2.0 Flash ExperimentalDeepMind1,360+/-443,767DeepMindProprietary
154DeepSeek-AIDeepSeek-V3DeepSeek-AI1,358+/-521,770DeepSeek-AIDeepSeek
155MistralAIMistral-Small-3.2MistralAI1,357+/-517,716MistralAIApache 2.0
156xAIgrok-3-mini-betaxAI1,357+/-522,724xAIProprietary
157PRintellect-3Prime Intellect1,357+/-85,337Prime IntellectMIT
158CohereAIC4AI Command A (202503)CohereAI1,353+/-356,304CohereAICC-BY-NC-4.0
159智谱GLM-4.5V智谱AI1,353+/-84,965智谱AIMIT
160DeepMindGemini 2.0 Flash-LiteDeepMind1,353+/-424,955DeepMindProprietary
161OpenAIGPT OSS 120BOpenAI1,353+/-430,653OpenAIApache 2.0
162Google Deep MindGemini 1.5 ProGoogle Deep Mind1,351+/-355,606Google Deep MindProprietary
163Amazonamazon-nova-experimental-chat-10-20Amazon1,350+/-611,479AmazonProprietary
164Tencenthunyuan-turbos-20250226Tencent1,348+/-122,220TencentProprietary
165StepFunAIStep3StepFunAI1,348+/-76,551StepFunAIApache 2.0
166Amazonamazon-nova-experimental-chat-10-09Amazon1,348+/-112,839AmazonProprietary
167OpenAIo3-miniOpenAI1,347+/-457,364OpenAIProprietary
168阿里Qwen3-32B阿里巴巴1,347+/-93,926阿里巴巴Apache 2.0
169Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1,347+/-122,549NvidiaNvidia Open Model
170INmercury-2Inception AI1,347+/-113,135Inception AIProprietary
171INling-flash-2.0InclusionAI1,346+/-77,015InclusionAIMIT
172MiniMaxAIMiniMax M2MiniMaxAI1,346+/-86,871MiniMaxAIApache 2.0
173Alibabaqwen-plus-0125Alibaba1,346+/-85,819AlibabaProprietary
174OpenAIGPT-4oOpenAI1,345+/-3112,881OpenAIProprietary
175Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1,343+/-103,346NvidiaNvidia Open
176ZHglm-4-plus-0111Zhipu1,343+/-85,760ZhipuProprietary
177AnthropicClaude 3.5 SonnetAnthropic1,342+/-382,419AnthropicProprietary
178Google Deep MindGemma 3 - 12B (IT)Google Deep Mind1,342+/-103,829Google Deep MindGemma
179Tencenthunyuan-turbo-0110Tencent1,340+/-122,290TencentProprietary
180OpenAIgpt-5-nano-highOpenAI1,337+/-78,274OpenAIProprietary
181亚马Nova 2 Lite亚马逊1,337+/-612,250亚马逊Proprietary
182OpenAIOpenAI o1-miniOpenAI1,337+/-451,981OpenAIProprietary
183阿里QwQ-32B阿里巴巴1,336+/-425,401阿里巴巴Apache 2.0
184xAIGrok 2xAI1,335+/-463,498xAIProprietary
185Googlegemini-advanced-0514Google1,335+/-550,148GoogleProprietary
186OpenAIGPT-4oOpenAI1,334+/-445,499OpenAIProprietary
187Metallama-3.1-405b-instruct-bf16Meta1,334+/-441,375MetaLlama 3.1 Community
188StepFunstep-2-16k-exp-202412StepFun1,334+/-94,833StepFunProprietary
189Metallama-3.1-405b-instruct-fp8Meta1,333+/-459,656MetaLlama 3.1 Community
190AIolmo-3.1-32b-instructAi21,330+/-612,240Ai2Apache 2.0
191AImolmo-2-8bAi21,329+/-21802Ai2Apache 2.0
19201yi-lightning01 AI1,328+/-527,33201 AIProprietary
193Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1,327+/-122,218NvidiaNvidia
194阿里Qwen3-30B-A3B阿里巴巴1,327+/-526,502阿里巴巴Apache 2.0
195Metallama-4-maverick-17b-128e-instructMeta1,327+/-439,996MetaLlama 4
196Tencenthunyuan-large-2025-02-10Tencent1,326+/-103,738TencentProprietary
197OpenAIgpt-4-turbo-2024-04-09OpenAI1,324+/-498,114OpenAIProprietary
198DeepSeekdeepseek-v2.5-1210DeepSeek1,323+/-86,795DeepSeekDeepSeek
199AnthropicClaude 3.5 HaikuAnthropic1,323+/-370,017AnthropicProprietary
200Google Deep MindGemini 1.5 ProGoogle Deep Mind1,323+/-479,138Google Deep MindProprietary
201Metallama-4-scout-17b-16e-instructMeta1,322+/-530,310MetaLlama
202OpenAIgpt-4.1-nano-2025-04-14OpenAI1,322+/-86,103OpenAIProprietary
203AnthropicClaude3-OpusAnthropic1,321+/-3194,909AnthropicProprietary
204INring-flash-2.0InclusionAI1,321+/-77,156InclusionAIMIT
205StepFunstep-1o-turbo-202506StepFun1,320+/-79,039StepFunProprietary
206ZHglm-4-plusZhipu AI1,319+/-526,126Zhipu AIProprietary
207Googlegemma-3n-e4b-itGoogle1,318+/-522,610GoogleGemma
208Metallama-3.3-70b-instructMeta1,318+/-354,748MetaLlama-3.3
209Alibabaqwen-max-0919Alibaba1,318+/-616,478AlibabaQwen
210OpenAIgpt-4o-mini-2024-07-18OpenAI1,317+/-468,710OpenAIProprietary
211OpenAIgpt-oss-20bOpenAI1,317+/-610,637OpenAIApache 2.0
212Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1,317+/-615,530NvidiaNVIDIA Open Model
213Alibabaqwen2.5-plus-1127Alibaba1,315+/-610,187AlibabaProprietary
214NEathene-v2-chatNexusFlow1,314+/-524,739NexusFlowNexusFlow
215Mistralmistral-large-2407Mistral1,313+/-445,459MistralMistral Research
216OpenAIgpt-4-0125-previewOpenAI1,312+/-493,439OpenAIProprietary
217OpenAIgpt-4-1106-previewOpenAI1,312+/-4100,105OpenAIProprietary
218Tencenthunyuan-standard-2025-02-10Tencent1,311+/-103,904TencentProprietary
219Googlegemini-1.5-flash-002Google1,309+/-434,902GoogleProprietary
220xAIgrok-2-mini-2024-08-13xAI1,308+/-452,567xAIProprietary
221DeepSeekdeepseek-v2.5DeepSeek1,307+/-524,572DeepSeekDeepSeek
222INmercuryInception AI1,306+/-141,958Inception AIProprietary
223NEathene-70b-0725NexusFlow1,306+/-619,621NexusFlowCC-BY-NC-4.0
224AIolmo-3-32b-thinkAi21,305+/-85,962Ai2Apache 2.0
225Mistralmistral-large-2411Mistral1,305+/-428,073MistralMRL
226Mistralmagistral-medium-2506Mistral1,303+/-611,646MistralProprietary
227Googlegemma-3-4b-itGoogle1,303+/-94,171GoogleGemma
228Mistralmistral-small-3.1-24b-instruct-2503Mistral1,303+/-533,231MistralApache 2.0
229Alibabaqwen2.5-72b-instructAlibaba1,302+/-439,406AlibabaQwen
230Nvidiallama-3.1-nemotron-70b-instructNvidia1,299+/-87,140NvidiaLlama 3.1
231Tencenthunyuan-large-visionTencent1,294+/-95,370TencentProprietary
232Metallama-3.1-70b-instructMeta1,293+/-455,240MetaLlama 3.1 Community
233Amazonamazon-nova-pro-v1.0Amazon1,290+/-524,745AmazonProprietary
234AIjamba-1.5-largeAI21 Labs1,288+/-78,662AI21 LabsJamba Open
235Googlegemma-2-27b-itGoogle1,288+/-375,754GoogleGemma license
236REreka-core-20240904Reka AI1,287+/-77,312Reka AIProprietary
237IBibm-granite-h-smallIBM1,287+/-85,679IBMApache 2.0
238OpenAIgpt-4-0314OpenAI1,286+/-554,173OpenAIProprietary
239AIllama-3.1-tulu-3-70bAi21,286+/-102,846Ai2Llama 3.1
240Nvidiallama-3.1-nemotron-51b-instructNvidia1,285+/-103,749NvidiaLlama 3.1
241Googlegemini-1.5-flash-001Google1,285+/-462,833GoogleProprietary
242AIolmo-3.1-32b-thinkAi21,285+/-78,512Ai2Apache 2.0
243Anthropicclaude-3-sonnet-20240229Anthropic1,280+/-4109,284AnthropicProprietary
244PRgemma-2-9b-it-simpoPrinceton1,279+/-710,072PrincetonMIT
245Nvidianemotron-4-340b-instructNvidia1,276+/-519,659NvidiaNVIDIA Open Model
246Coherecommand-r-plus-08-2024Cohere1,276+/-79,866CohereCC-BY-NC-4.0
247Metallama-3-70b-instructMeta1,275+/-4156,876MetaLlama 3 Community
248OpenAIgpt-4-0613OpenAI1,274+/-488,723OpenAIProprietary
249Mistralmistral-small-24b-instruct-2501Mistral1,274+/-614,681MistralApache 2.0
250ZHglm-4-0520Zhipu AI1,273+/-79,788Zhipu AIProprietary
251REreka-flash-20240904Reka AI1,271+/-77,536Reka AIProprietary
252Alibabaqwen2.5-coder-32b-instructAlibaba1,270+/-85,432AlibabaApache 2.0
253Coherec4ai-aya-expanse-32bCohere1,266+/-527,124CohereCC-BY-NC-4.0
254Googlegemma-2-9b-itGoogle1,265+/-454,611GoogleGemma license
255DeepSeekdeepseek-coder-v2DeepSeek1,264+/-615,147DeepSeekDeepSeek License
256Coherecommand-r-plusCohere1,261+/-477,554CohereCC-BY-NC-4.0
257Alibabaqwen2-72b-instructAlibaba1,261+/-537,325AlibabaQianwen LICENSE
258Anthropicclaude-3-haiku-20240307Anthropic1,260+/-4117,701AnthropicProprietary
259Amazonamazon-nova-lite-v1.0Amazon1,260+/-519,372AmazonProprietary
260Googlegemini-1.5-flash-8b-001Google1,258+/-435,558GoogleProprietary
261Microsoft AzurePhi 4 - 14BMicrosoft Azure1,256+/-524,126Microsoft AzureMIT
262AIolmo-2-0325-32b-instructAi21,251+/-113,334Ai2Apache-2.0
263Coherecommand-r-08-2024Cohere1,249+/-710,140CohereCC-BY-NC-4.0
264Mistralmistral-large-2402Mistral1,241+/-562,436MistralProprietary
265Amazonamazon-nova-micro-v1.0Amazon1,240+/-519,364AmazonProprietary
266AIjamba-1.5-miniAI21 Labs1,239+/-78,858AI21 LabsJamba Open
267Mistralministral-8b-2410Mistral1,237+/-94,781MistralMRL
268Googlegemini-pro-dev-apiGoogle1,235+/-718,354GoogleProprietary
269Alibabaqwen1.5-110b-chatAlibaba1,233+/-626,195AlibabaQianwen LICENSE
270Tencenthunyuan-standard-256kTencent1,233+/-122,728TencentProprietary
271REreka-flash-21b-20240226-onlineReka AI1,232+/-715,450Reka AIProprietary
272Alibabaqwen1.5-72b-chatAlibaba1,232+/-539,302AlibabaQianwen LICENSE
273Mistralmixtral-8x22b-instruct-v0.1Mistral1,228+/-551,416MistralApache 2.0
274Coherecommand-rCohere1,226+/-554,036CohereCC-BY-NC-4.0
275REreka-flash-21b-20240226Reka AI1,226+/-624,806Reka AIProprietary
276OpenAIgpt-3.5-turbo-0125OpenAI1,223+/-566,207OpenAIProprietary
277Metallama-3-8b-instructMeta1,222+/-4104,642MetaLlama 3 Community
278Coherec4ai-aya-expanse-8bCohere1,222+/-79,818CohereCC-BY-NC-4.0
279Mistralmistral-mediumMistral1,222+/-634,550MistralProprietary
280Googlegemini-proGoogle1,221+/-126,390GoogleProprietary
281AIllama-3.1-tulu-3-8bAi21,220+/-112,896Ai2Llama 3.1
28201yi-1.5-34b-chat01 AI1,212+/-524,14601 AIApache-2.0
283HUzephyr-orpo-141b-A35b-v0.1HuggingFace1,212+/-114,652HuggingFaceApache 2.0
284Metallama-3.1-8b-instructMeta1,211+/-449,605MetaLlama 3.1 Community
285IBgranite-3.1-8b-instructIBM1,207+/-113,090IBMApache 2.0
286Alibabaqwen1.5-32b-chatAlibaba1,203+/-621,741AlibabaQianwen LICENSE
287OpenAIgpt-3.5-turbo-1106OpenAI1,202+/-916,619OpenAIProprietary
288Googlegemma-2-2b-itGoogle1,199+/-446,616GoogleGemma license
289Microsoftphi-3-medium-4k-instructMicrosoft1,197+/-525,055MicrosoftMIT
290Mistralmixtral-8x7b-instruct-v0.1Mistral1,196+/-473,503MistralApache 2.0
291DAdbrx-instruct-previewDatabricks1,194+/-632,191DatabricksDBRX LICENSE
292INinternlm2_5-20b-chatInternLM1,191+/-79,901InternLMOther
293Alibabaqwen1.5-14b-chatAlibaba1,190+/-717,839AlibabaQianwen LICENSE
294Microsoftwizardlm-70bMicrosoft1,184+/-98,214MicrosoftLlama 2 Community
295DeepSeekdeepseek-llm-67b-chatDeepSeek1,183+/-124,932DeepSeekDeepSeek License
29601yi-34b-chat01 AI1,183+/-715,48301 AIYi License
297OPopenchat-3.5-0106OpenChat1,181+/-812,637OpenChatApache-2.0
298OPopenchat-3.5OpenChat1,181+/-107,968OpenChatApache-2.0
299IBgranite-3.0-8b-instructIBM1,181+/-96,638IBMApache 2.0
300Googlegemma-1.1-7b-itGoogle1,180+/-623,893GoogleGemma license
301SNsnowflake-arctic-instructSnowflake1,178+/-632,832SnowflakeApache 2.0
302IBgranite-3.1-2b-instructIBM1,178+/-113,188IBMApache 2.0
303ALtulu-2-dpo-70bAllenAI/UW1,177+/-106,535AllenAI/UWAI2 ImpACT Low-risk
304NOopenhermes-2.5-mistral-7bNousResearch1,174+/-105,006NousResearchApache-2.0
305LMvicuna-33bLMSYS1,172+/-622,479LMSYSNon-commercial
306NEstarling-lm-7b-betaNexusflow1,171+/-716,056NexusflowApache-2.0
307Microsoftphi-3-small-8k-instructMicrosoft1,170+/-617,766MicrosoftMIT
308Metallama-2-70b-chatMeta1,170+/-638,492MetaLlama 2 Community
309UCstarling-lm-7b-alphaUC Berkeley1,166+/-810,224UC BerkeleyCC-BY-NC-4.0
310Metallama-3.2-3b-instructMeta1,166+/-87,936MetaLlama 3.2
311NOnous-hermes-2-mixtral-8x7b-dpoNousResearch1,164+/-123,777NousResearchApache-2.0
312Alibabaqwq-32b-previewAlibaba1,156+/-123,231AlibabaApache 2.0
313IBgranite-3.0-2b-instructIBM1,155+/-86,837IBMApache 2.0
314Nvidiallama2-70b-steerlm-chatNvidia1,154+/-133,585NvidiaLlama 2 Community
315UPsolar-10.7b-instruct-v1.0Upstage AI1,151+/-134,155Upstage AICC-BY-NC-4.0
316COdolphin-2.2.1-mistral-7bCognitive Computations1,151+/-151,679Cognitive ComputationsApache-2.0
317MOmpt-30b-chatMosaicML1,149+/-122,572MosaicMLCC-BY-NC-SA-4.0
318Mistralmistral-7b-instruct-v0.2Mistral1,148+/-719,402MistralApache-2.0
319Microsoftwizardlm-13bMicrosoft1,148+/-97,044MicrosoftLlama 2 Community
320TIfalcon-180b-chatTII1,146+/-171,295TIIFalcon-180B TII License
321Alibabaqwen1.5-7b-chatAlibaba1,143+/-104,737AlibabaQianwen LICENSE
322Microsoftphi-3-mini-4k-instruct-june-2024Microsoft1,142+/-612,297MicrosoftMIT
323Metallama-2-13b-chatMeta1,141+/-719,174MetaLlama 2 Community
324LMvicuna-13bLMSYS1,140+/-719,367LMSYSLlama 2 Community
325Alibabaqwen-14b-chatAlibaba1,137+/-114,964AlibabaQianwen LICENSE
326Googlepalm-2Google1,136+/-98,554GoogleProprietary
327Googlegemma-7b-itGoogle1,136+/-108,925GoogleGemma license
328Metacodellama-34b-instructMeta1,136+/-97,366MetaLlama 2 Community
329HUzephyr-7b-betaHuggingFace1,130+/-911,118HuggingFaceMIT
330Microsoftphi-3-mini-128k-instructMicrosoft1,128+/-720,685MicrosoftMIT
331Microsoftphi-3-mini-4k-instructMicrosoft1,127+/-620,118MicrosoftMIT
332UWguanaco-33bUW1,126+/-122,921UWNon-commercial
333HUzephyr-7b-alphaHuggingFace1,126+/-161,785HuggingFaceMIT
334TOstripedhyena-nous-7bTogether AI1,120+/-115,182Together AIApache 2.0
335Metacodellama-70b-instructMeta1,118+/-181,143MetaLlama 2 Community
336Googlegemma-1.1-2b-itGoogle1,114+/-810,854GoogleGemma license
337LMvicuna-7bLMSYS1,114+/-96,923LMSYSLlama 2 Community
338HUsmollm2-1.7b-instructHuggingFace1,113+/-142,199HuggingFaceApache 2.0
339Metallama-3.2-1b-instructMeta1,110+/-88,045MetaLlama 3.2
340Mistralmistral-7b-instructMistral1,109+/-98,977MistralApache 2.0
341Metallama-2-7b-chatMeta1,107+/-714,148MetaLlama 2 Community
342Googlegemma-2b-itGoogle1,091+/-124,780GoogleGemma license
343Alibabaqwen1.5-4b-chatAlibaba1,089+/-97,597AlibabaQianwen LICENSE
344AIolmo-7b-instructAi21,073+/-116,328Ai2Apache-2.0
345UCkoala-13bUC Berkeley1,069+/-106,965UC BerkeleyNon-commercial
346STalpaca-13bStanford1,067+/-115,745StanfordNon-commercial
347NOgpt4all-13b-snoozyNomic AI1,065+/-151,743Nomic AINon-commercial
348MOmpt-7b-chatMosaicML1,061+/-123,924MosaicMLCC-BY-NC-SA-4.0
349TSchatglm3-6bTsinghua1,055+/-124,658TsinghuaApache-2.0
350RWRWKV-4-Raven-14BRWKV1,040+/-114,845RWKVApache 2.0
351TSchatglm2-6bTsinghua1,023+/-142,658TsinghuaApache-2.0
352OPoasst-pythia-12bOpenAssistant1,021+/-116,310OpenAssistantApache 2.0
353TSchatglm-6bTsinghua994+/-134,914TsinghuaNon-commercial
354LMfastchat-t5-3bLMSYS990+/-124,203LMSYSApache 2.0
355DAdolly-v2-12bDatabricks979+/-143,412DatabricksMIT
356Metallama-13bMeta972+/-162,391MetaNon-commercial
357STstablelm-tuned-alpha-7bStability AI952+/-133,287Stability AICC-BY-NC-SA-4.0

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

01

什么是 Text Generation Arena (LMArena)?

Text Generation Arena(原 LMSYS Chatbot Arena)是目前最具影响力的大模型匿名评测平台。用户向两个身份未知的模型提问,根据回答质量投票,系统通过 Elo 算法将数百万次投票汇聚为动态排行榜,被学术界和工业界广泛引用。

02

Arena Elo 分数是如何计算的?

Elo 算法源自国际象棋评分体系。每次对战后,胜者得分上升、败者下降,幅度取决于双方原始评分差距。95% 置信区间(CI)反映该模型参与对战次数的多少:CI 越窄说明数据越充分、排名越可信。

03

为什么同一模型会出现"Thinking"和普通两个版本?

部分模型支持"扩展思考"(Extended Thinking)模式,会在给出最终答案前进行更深入的内部推理。该模式通常在逻辑推理、数学和编程任务上得分更高,但响应时延也更长、成本更高。Arena 将两种模式分开评测,以便用户根据实际需求选择。

04

如何根据排行榜选择适合自己的大语言模型?

建议综合考虑:综合性能(看 Elo 总分)、成本(闭源 API 按量计费,开源可自部署)、中文支持、开源程度以及响应速度。