DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Text Generation Arena 文本生成模型排行榜

LMArena 评测赛道

文本生成代码数学图像编辑文字生成视频图生视频文生图

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Opus 4.7 (thinking)

最高得分

1,503

模型数量

200

数据版本

2026年04月24日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

筛选条件

榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
OPOpus 4.7 (thinking)1,503+/-85,321AnthropicProprietary
CLClaude Opus 4.6 (thinking)1,503+/-520,192AnthropicProprietary
CLClaude Opus 4.61,496+/-521,537AnthropicProprietary
4OPOpus 4.71,494+/-86,017AnthropicProprietary
5GEGemini 3.1 Pro Preview1,493+/-525,353Google Deep MindProprietary
6MUMuse Spark1,492+/-77,213Facebook AI研究实验室Proprietary
7GEGemini 3.0 Pro (Preview 11-2025)1,486+/-441,383Google Deep MindProprietary
8GRgrok-4.20-beta11,482+/-614,620xAIProprietary
9GPgpt-5.4-high1,481+/-613,593OpenAIProprietary
10GRgrok-4.20-beta-0309-reasoning1,479+/-613,841xAIProprietary
11GPgpt-5.2-chat-latest-202602101,476+/-519,964OpenAIProprietary
12GRgrok-4.20-multi-agent-beta-03091,476+/-514,223xAIProprietary
13GEGemini 3.0 Flash1,474+/-430,791Google Deep MindProprietary
14CLClaude Opus 4 (thinking-32k)1,473+/-437,164AnthropicProprietary
15GLGLM 5.11,470+/-79,028智谱AIMIT
16GRGrok 4.1 Thinking1,469+/-451,489xAIProprietary
17CLClaude Opus 41,469+/-451,295AnthropicProprietary
18GPgpt-5.41,467+/-614,170OpenAIProprietary
19QWqwen3.5-max-preview1,466+/-611,416AlibabaProprietary
20DEdeepseek-v4-pro1,463+/-94,163DeepSeekMIT
21CLClaude Sonnet 4.61,463+/-613,434AnthropicProprietary
22GEGemini 3.0 Flash (thinking-minimal)1,463+/-437,596Google Deep MindProprietary
23DEdeepseek-v4-pro-thinking1,462+/-93,783DeepSeekMIT
24GRGrok 4.11,461+/-455,513xAIProprietary
25DOdola-seed-2.0-pro1,460+/-522,902BytedanceProprietary
26KIkimi-k2.61,458+/-94,355MoonshotModified MIT
27GPgpt-5.4-mini-high1,457+/-611,237OpenAIProprietary
28GLGLM-51,457+/-517,733智谱AIMIT
29GPGPT-5.1 Pro1,455+/-440,871OpenAIProprietary
30CLClaude Sonnet 4.5 (thinking-32k)1,453+/-363,725AnthropicProprietary
31CLClaude Sonnet 4.51,452+/-361,608AnthropicProprietary
32GEgemma-4-31b1,451+/-85,818GoogleApache 2.0
33ERERNIE 5.01,450+/-425,815百度Proprietary
34GPgpt-5.3-chat-latest1,450+/-518,707OpenAIProprietary
35ERERNIE 5.01,449+/-79,767百度Proprietary
36KIKimi K2 Thinking1,449+/-424,213Moonshot AIModified MIT
37OPOpus 4.1 (thinking-16k)1,449+/-349,864AnthropicProprietary
38GEGemini 2.5 Pro Experimental 03-251,448+/-3111,209Google Deep MindProprietary
39MImimo-v2-pro1,448+/-612,059XiaomiProprietary
40OPOpus 4.11,447+/-377,453AnthropicProprietary
41QWqwen3.6-plus1,447+/-85,480AlibabaProprietary
42QWQwen3.5-397B-A17B1,446+/-519,254阿里巴巴Apache 2.0
43GPGPT-4.51,444+/-614,547OpenAIProprietary
44CHchatgpt-4o-latest-202503261,443+/-382,567OpenAIProprietary
45GLGLM-4.71,443+/-612,138智谱AIMIT
46GPGPT-5.2 Pro1,440+/-434,410OpenAIProprietary
47DEdeepseek-v4-flash-thinking1,439+/-93,607DeepSeekMIT
48GEgemini-3.1-flash-lite-preview1,439+/-520,088GoogleProprietary
49GEgemma-4-26b-a4b1,439+/-85,778GoogleApache 2.0
50GPGPT-5.1 Instant1,439+/-443,516OpenAIProprietary
51GPGPT-5.21,439+/-431,577OpenAIProprietary
52QWQwen3 Max (Preview)1,435+/-527,769阿里巴巴Proprietary
53LOlongcat-flash-chat-2602-exp1,434+/-69,660MeituanProprietary
54GPGPT-5-Pro1,433+/-531,986OpenAIProprietary
55DEdeepseek-v4-flash1,433+/-93,497DeepSeekMIT
56KIkimi-k2.5-instant1,432+/-78,205MoonshotModified MIT
57GRgrok-4-1-fast-reasoning1,432+/-446,539xAIProprietary
58OPOpenAI o31,431+/-459,795OpenAIProprietary
59KIkimi-k2-thinking-turbo1,430+/-349,631MoonshotModified MIT
60AMamazon-nova-experimental-chat-26-02-101,428+/-103,421AmazonProprietary
61GPGPT-51,426+/-431,630OpenAIProprietary
62GLGLM-4.61,426+/-435,711智谱AIMIT
63DEDeepSeek V3.2-Exp (thinking)1,425+/-79,078DeepSeek-AIMIT
64DEDeepSeek V3.21,424+/-444,738DeepSeek-AIMIT
65QWqwen3-max-2025-09-231,424+/-69,181AlibabaProprietary
66CLClaude Opus 4 (thinking-16k)1,424+/-436,951AnthropicProprietary
67DEDeepSeek V3.2-Exp1,423+/-611,949DeepSeek-AIMIT
68QWQwen3-235B-A22B-25071,423+/-385,263阿里巴巴Apache 2.0
69DEDeepSeek V3.2 (thinking)1,422+/-438,982DeepSeek-AIMIT
70DEDeepSeek-R1-05281,422+/-618,474DeepSeek-AIMIT
71GRGrok 4 Fast1,421+/-86,826xAIProprietary
72ERERNIE 5.01,419+/-94,732百度Proprietary
73QWqwen3.5-122b-a10b1,418+/-516,040AlibabaApache 2.0
74KIkimi-k2-0905-preview1,418+/-711,800MoonshotModified MIT
75DEDeepSeek-V3.11,418+/-614,992DeepSeek-AIMIT
76KIKimi K21,417+/-527,668Moonshot AIModified MIT
77DEdeepseek-v3.1-terminus-thinking1,417+/-103,469DeepSeekMIT
78DEDeepSeek-V3.1 (thinking)1,417+/-711,759DeepSeek-AIMIT
79DEDeepSeek-V3.1 Terminus1,416+/-103,713DeepSeek-AIMIT
80AMamazon-nova-experimental-chat-26-01-101,416+/-103,420AmazonProprietary
81QWQwen3-VL-235B-A22B-Instruct1,416+/-611,528阿里巴巴Apache 2.0
82MIMistral Large 31,415+/-441,361MistralAIApache 2.0
83GPgpt-4.1-2025-04-141,413+/-451,065OpenAIProprietary
84CLClaude Opus 41,412+/-444,273AnthropicProprietary
85GRGrok 31,412+/-432,920xAIProprietary
86GLGLM-4.51,411+/-524,352智谱AIMIT
87GEGemini 2.5 Flash1,411+/-3110,831Google Deep MindProprietary
88GRgrok-4-07091,410+/-441,428xAIProprietary
89MAMagistral-Medium-25061,409+/-380,898MistralAIProprietary
90CLclaude-haiku-4-5-202510011,408+/-363,329AnthropicProprietary
91QWqwen3.5-27b1,405+/-515,682AlibabaApache 2.0
92GEgemini-2.5-flash-preview-09-20251,405+/-432,953GoogleProprietary
93GPgpt-5.4-nano-high1,405+/-610,607OpenAIProprietary
94GRgrok-4-fast-reasoning1,404+/-518,745xAIProprietary
95MIMiniMax-M2.71,403+/-610,307MiniMaxAIModified MIT
96QWqwen3-235b-a22b-no-thinking1,403+/-438,253AlibabaApache 2.0
97O1o1-2024-12-171,402+/-427,807OpenAIProprietary
98QWqwen3-next-80b-a3b-instruct1,402+/-522,907AlibabaApache 2.0
99LOlongcat-flash-chat1,401+/-611,417MeituanMIT
100MIMiniMax M2.51,399+/-521,236MiniMaxAIModified MIT
101QWqwen3-235b-a22b-thinking-25071,399+/-79,008AlibabaApache 2.0
102CLClaude Sonnet 4 (thinking-32k)1,399+/-435,153AnthropicProprietary
103DEDeepSeek-R11,398+/-518,524DeepSeek-AIMIT
104QWqwen3.5-flash1,397+/-516,394AlibabaProprietary
105HUhunyuan-vision-1.5-thinking1,396+/-122,218TencentProprietary
106QWqwen3-vl-235b-a22b-thinking1,396+/-77,947AlibabaApache 2.0
107QWqwen3.5-35b-a3b1,395+/-516,250AlibabaApache 2.0
108AMamazon-nova-experimental-chat-12-101,395+/-103,687AmazonProprietary
109DEDeepSeek-V3-03241,395+/-445,546DeepSeek-AIMIT
110MImimo-v2-flash (non-thinking)1,393+/-433,906XiaomiMIT
111MAmai-1-preview1,392+/-517,901Microsoft AIProprietary
112STStep 3.5 Flash1,392+/-521,987StepFunAIApache 2.0
113GPgpt-5-mini-high1,390+/-527,067OpenAIProprietary
114O4o4-mini-2025-04-161,390+/-445,482OpenAIProprietary
115CLClaude Sonnet 41,389+/-440,359AnthropicProprietary
116O1o1-preview1,388+/-531,122OpenAIProprietary
117HUhunyuan-t1-202507111,387+/-94,714TencentProprietary
118MImimo-v2-flash (thinking)1,387+/-610,980XiaomiMIT
119QWqwen3-coder-480b-a35b-instruct1,387+/-525,762AlibabaApache 2.0
120CLClaude Sonnet 3.7 (thinking-32k)1,387+/-438,841AnthropicProprietary
121MImistral-medium-25051,387+/-533,260MistralProprietary
122MIminimax-m2.1-preview1,385+/-517,150MiniMaxMIT
123QWqwen3-30b-a3b-instruct-25071,383+/-523,768AlibabaApache 2.0
124HUhunyuan-turbos-202504161,382+/-610,723TencentProprietary
125GPgpt-4.1-mini-2025-04-141,382+/-439,371OpenAIProprietary
126GEgemini-2.5-flash-lite-preview-09-2025-no-thinking1,380+/-347,291GoogleProprietary
127GLGLM-4.6V1,378+/-112,806智谱AIMIT
128QWqwen3-235b-a22b1,375+/-526,285AlibabaApache 2.0
129TRtrinity-large-preview1,375+/-517,170Arcee AIApache 2.0
130GEgemini-2.5-flash-lite-preview-06-17-thinking1,374+/-532,957GoogleProprietary
131QWqwen2.5-max1,374+/-432,629AlibabaProprietary
132GLglm-4.5-air1,373+/-431,136Z.aiMIT
133CLclaude-3-5-sonnet-202410221,372+/-388,367AnthropicProprietary
134CLClaude Sonnet 3.71,371+/-443,216AnthropicProprietary
135QWqwen3-next-80b-a3b-thinking1,369+/-613,718AlibabaApache 2.0
136GLglm-4.7-flash1,368+/-611,769Z.aiMIT
137AMamazon-nova-experimental-chat-11-101,367+/-425,430AmazonProprietary
138GEgemma-3-27b-it1,366+/-447,577GoogleGemma
139MIminimax-m11,363+/-435,252MiniMaxApache 2.0
140O3o3-mini-high1,363+/-518,589OpenAIProprietary
141GRgrok-3-mini-high1,362+/-516,979xAIProprietary
142NVnvidia-nemotron-3-super-120b-a12b1,361+/-77,408NvidiaNVIDIA Open Model
143GEgemini-2.0-flash-0011,360+/-443,771GoogleProprietary
144DEdeepseek-v31,358+/-521,770DeepSeekDeepSeek
145MImistral-small-25061,357+/-517,722MistralApache 2.0
146GRgrok-3-mini-beta1,357+/-522,722xAIProprietary
147INintellect-31,356+/-85,332Prime IntellectMIT
148COcommand-a-03-20251,354+/-356,346CohereCC-BY-NC-4.0
149GPgpt-oss-120b1,353+/-430,674OpenAIApache 2.0
150GLglm-4.5v1,353+/-84,968Z.aiMIT
151GEgemini-2.0-flash-lite-preview-02-051,353+/-424,955GoogleProprietary
152GEgemini-1.5-pro-0021,351+/-355,606GoogleProprietary
153AMamazon-nova-experimental-chat-10-201,350+/-611,486AmazonProprietary
154HUhunyuan-turbos-202502261,349+/-122,220TencentProprietary
155STstep-31,348+/-76,560StepFunApache 2.0
156O3o3-mini1,348+/-457,375OpenAIProprietary
157AMamazon-nova-experimental-chat-10-091,347+/-112,839AmazonProprietary
158QWqwen3-32b1,347+/-93,926AlibabaApache 2.0
159LLllama-3.1-nemotron-ultra-253b-v11,347+/-122,549NvidiaNvidia Open Model
160MEmercury-21,347+/-113,124Inception AIProprietary
161MIminimax-m21,346+/-86,876MiniMaxApache 2.0
162QWqwen-plus-01251,346+/-85,819AlibabaProprietary
163LIling-flash-2.01,346+/-77,018InclusionAIMIT
164GPgpt-4o-2024-05-131,345+/-3112,881OpenAIProprietary
165GLglm-4-plus-01111,343+/-85,760ZhipuProprietary
166NVnvidia-llama-3.3-nemotron-super-49b-v1.51,343+/-103,347NvidiaNvidia Open
167CLclaude-3-5-sonnet-202406201,342+/-382,419AnthropicProprietary
168GEgemma-3-12b-it1,342+/-103,829GoogleGemma
169HUhunyuan-turbo-01101,340+/-122,290TencentProprietary
170NOnova-2-lite1,337+/-612,251AmazonProprietary
171GPgpt-5-nano-high1,337+/-78,281OpenAIProprietary
172O1o1-mini1,337+/-451,981OpenAIProprietary
173QWqwq-32b1,336+/-425,406AlibabaApache 2.0
174GRgrok-2-2024-08-131,335+/-463,498xAIProprietary
175GEgemini-advanced-05141,335+/-550,148GoogleProprietary
176GPgpt-4o-2024-08-061,335+/-445,499OpenAIProprietary
177LLllama-3.1-405b-instruct-bf161,335+/-441,375MetaLlama 3.1 Community
178STstep-2-16k-exp-2024121,334+/-94,833StepFunProprietary
179LLllama-3.1-405b-instruct-fp81,333+/-459,656MetaLlama 3.1 Community
180OLolmo-3.1-32b-instruct1,331+/-612,238Ai2Apache 2.0
181YIyi-lightning1,328+/-527,33201 AIProprietary
182MOmolmo-2-8b1,328+/-21803Ai2Apache 2.0
183LLllama-3.3-nemotron-49b-super-v11,327+/-122,218NvidiaNvidia
184QWqwen3-30b-a3b1,327+/-526,507AlibabaApache 2.0
185LLllama-4-maverick-17b-128e-instruct1,327+/-440,010MetaLlama 4
186HUhunyuan-large-2025-02-101,326+/-103,738TencentProprietary
187GPgpt-4-turbo-2024-04-091,324+/-498,114OpenAIProprietary
188DEdeepseek-v2.5-12101,323+/-86,795DeepSeekDeepSeek
189GEgemini-1.5-pro-0011,323+/-479,138GoogleProprietary
190CLclaude-3-5-haiku-202410221,323+/-370,029AnthropicProprietary
191LLllama-4-scout-17b-16e-instruct1,322+/-530,319MetaLlama
192GPgpt-4.1-nano-2025-04-141,322+/-86,103OpenAIProprietary
193CLClaude3-Opus1,321+/-3194,909AnthropicProprietary
194RIring-flash-2.01,321+/-77,158InclusionAIMIT
195STstep-1o-turbo-2025061,320+/-79,044StepFunProprietary
196GLglm-4-plus1,319+/-526,126Zhipu AIProprietary
197LLllama-3.3-70b-instruct1,318+/-354,756MetaLlama-3.3
198GEgemma-3n-e4b-it1,318+/-522,616GoogleGemma
199QWqwen-max-09191,318+/-616,478AlibabaQwen
200GPgpt-oss-20b1,317+/-610,640OpenAIApache 2.0

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

常见问题 (FAQ)

什么是 Text Generation Arena (LMArena)?▼
Text Generation Arena(原 LMSYS Chatbot Arena)是目前最具影响力的大模型匿名评测平台。用户向两个身份未知的模型提问,根据回答质量投票,系统通过 Elo 算法将数百万次投票汇聚为动态排行榜,被学术界和工业界广泛引用。
Arena Elo 分数是如何计算的?▼
Elo 算法源自国际象棋评分体系。每次对战后,胜者得分上升、败者下降,幅度取决于双方原始评分差距——击败强模型加分更多,输给弱模型扣分也更多。95% 置信区间(CI)反映该模型参与对战次数的多少:CI 越窄说明数据越充分、排名越可信。
为什么同一模型会出现"Thinking"和普通两个版本?▼
部分模型支持"扩展思考"(Extended Thinking)模式,会在给出最终答案前进行更深入的内部推理。该模式通常在逻辑推理、数学和编程任务上得分更高,但响应时延也更长、成本更高。Arena 将两种模式分开评测,以便用户根据实际需求选择。
如何根据排行榜选择适合自己的大语言模型?▼
建议综合考虑:综合性能(看 Elo 总分)、成本(闭源 API 按量计费,开源可自部署)、中文支持(国产模型如 Qwen、GLM、DeepSeek 在中文场景更占优)、开源程度(MIT/Apache 协议可商用)以及响应速度(Flash/mini 轻量版时延更低)。