DataLearner 标志DataLearnerAI
最新AI资讯
大模型评测
大模型列表
大模型对比
资源中心
Tools
语言中文

加载中...

DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页评测总览Text Generation Arena 文本生成模型排行榜

LMArena 评测赛道

文本生成图像编辑文字生成视频图生视频文生图

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

claude-opus-4-6-thinking

最高得分

1,502

模型数量

100

数据版本

2026年03月20日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

文本生成 Elo 分数排名

Top 10

图表来源:DataLearnerAI · 数据来源:LMArena

排名总表

排名模型名称得分95% CI投票数机构许可证
1claude-opus-4-6-thinking1,502+/-611,801AnthropicProprietary
2claude-opus-4-61,501+/-612,546AnthropicProprietary
3gemini-3.1-pro-preview1,493+/-614,677GoogleProprietary
4grok-4.20-beta11,492+/-77,396xAIProprietary
5gemini-3-pro1,486+/-441,762GoogleProprietary
6gpt-5.4-high1,485+/-94,965OpenAIProprietary
7gpt-5.2-chat-latest-202602101,482+/-610,140OpenAIProprietary
8grok-4.20-beta-0309-reasoning1,481+/-94,504xAIProprietary
9gemini-3-flash1,475+/-431,060GoogleProprietary
10claude-opus-4-5-20251101-thinking-32k1,474+/-437,036AnthropicProprietary
11grok-4.1-thinking1,472+/-443,930xAIProprietary
12claude-opus-4-5-202511011,469+/-441,976AnthropicProprietary
13claude-sonnet-4-61,465+/-69,843AnthropicProprietary
14qwen3.5-max-preview1,464+/-94,252AlibabaProprietary
15gpt-5.3-chat-latest1,464+/-78,942OpenAIProprietary
16gemini-3-flash (thinking-minimal)1,463+/-427,448GoogleProprietary
17gpt-5.41,463+/-84,972OpenAIProprietary
18dola-seed-2.0-preview1,462+/-610,651BytedanceProprietary
19grok-4.11,461+/-447,757xAIProprietary
20gpt-5.1-high1,455+/-440,759OpenAIProprietary
21glm-51,455+/-611,093Z.aiMIT
22kimi-k2.5-thinking1,453+/-516,262MoonshotModified MIT
23claude-sonnet-4-5-202509291,453+/-353,556AnthropicProprietary
24claude-sonnet-4-5-20250929-thinking-32k1,453+/-355,811AnthropicProprietary
25ernie-5.0-01101,452+/-518,715BaiduProprietary
26qwen3.5-397b-a17b1,452+/-610,431AlibabaApache 2.0
27ernie-5.0-preview-12031,450+/-79,857BaiduProprietary
28claude-opus-4-1-20250805-thinking-16k1,449+/-350,375AnthropicProprietary
29gemini-2.5-pro1,448+/-3103,317GoogleProprietary
30claude-opus-4-1-202508051,447+/-378,224AnthropicProprietary
31mimo-v2-pro1,445+/-103,531XiaomiProprietary
32gpt-4.5-preview-2025-02-271,444+/-614,547OpenAIProprietary
33chatgpt-4o-latest-202503261,443+/-383,559OpenAIProprietary
34glm-4.71,443+/-612,242Z.aiMIT
35gpt-5.2-high1,442+/-525,328OpenAIProprietary
36gpt-5.21,440+/-522,231OpenAIProprietary
37gpt-5.11,439+/-443,475OpenAIProprietary
38gemini-3.1-flash-lite-preview1,438+/-93,881GoogleProprietary
39qwen3-max-preview1,435+/-428,066AlibabaProprietary
40gpt-5-high1,434+/-532,470OpenAIProprietary
41kimi-k2.5-instant1,433+/-78,257MoonshotModified MIT
42o3-2025-04-161,432+/-460,698OpenAIProprietary
43grok-4-1-fast-reasoning1,431+/-437,473xAIProprietary
44kimi-k2-thinking-turbo1,430+/-441,738MoonshotModified MIT
45amazon-nova-exp-chat-26-02-101,429+/-103,467AmazonProprietary
46gpt-5-chat1,426+/-432,009OpenAIProprietary
47glm-4.61,426+/-436,102Z.aiMIT
48deepseek-v3.2-exp-thinking1,425+/-79,188DeepSeekMIT
49deepseek-v3.21,425+/-436,511DeepSeekMIT
50qwen3-max-2025-09-231,424+/-69,273AlibabaProprietary
51claude-opus-4-20250514-thinking1,424+/-437,503AnthropicProprietary
52deepseek-v3.2-exp1,423+/-612,088DeepSeekMIT
53qwen3-235b-a22b-instruct1,422+/-377,683AlibabaApache 2.0
54deepseek-v3.2-thinking1,422+/-431,048DeepSeekMIT
55deepseek-r1-05281,421+/-618,831DeepSeekMIT
56grok-4-fast-chat1,421+/-86,901xAIProprietary
57ernie-5.0-preview-10221,419+/-94,782BaiduProprietary
58deepseek-v3.11,418+/-615,150DeepSeekMIT
59kimi-k2-0905-preview1,418+/-611,924MoonshotModified MIT
60qwen3.5-122b-a10b1,417+/-76,946AlibabaApache 2.0
61kimi-k2-0711-preview1,417+/-528,082MoonshotModified MIT
62deepseek-v3.1-thinking1,417+/-711,885DeepSeekMIT
63deepseek-v3.1-terminus-think1,416+/-103,497DeepSeekMIT
64mistral-large-31,416+/-433,200MistralApache 2.0
65deepseek-v3.1-terminus1,416+/-103,736DeepSeekMIT
66qwen3-vl-235b-a22b-instruct1,415+/-611,645AlibabaApache 2.0
67amazon-nova-exp-chat-26-01-101,414+/-103,439AmazonProprietary
68gpt-4.1-2025-04-141,413+/-451,831OpenAIProprietary
69claude-opus-4-202505141,413+/-444,988AnthropicProprietary
70grok-3-preview-02-241,412+/-433,374xAIProprietary
71gemini-2.5-flash1,411+/-3102,736GoogleProprietary
72glm-4.51,411+/-524,640Z.aiMIT
73grok-4-07091,410+/-442,034xAIProprietary
74mistral-medium-25081,410+/-372,410MistralProprietary
75minimax-m2.71,407+/-112,981MiniMaxProprietary
76claude-haiku-4-5-202510011,407+/-354,261AnthropicProprietary
77qwen3.5-27b1,406+/-76,957AlibabaApache 2.0
78minimax-m2.51,405+/-611,909MiniMaxModified MIT
79gemini-2.5-flash-preview1,405+/-433,278GoogleProprietary
80grok-4-fast-reasoning1,405+/-518,993xAIProprietary
81qwen3-235b-a22b-no-thinking1,403+/-438,797AlibabaApache 2.0
82o1-2024-12-171,402+/-427,807OpenAIProprietary
83qwen3-next-80b-a3b-instruct1,401+/-523,187AlibabaApache 2.0
84qwen3.5-flash1,401+/-77,853AlibabaProprietary
85qwen3.5-35b-a3b1,401+/-77,278AlibabaApache 2.0
86longcat-flash-chat1,400+/-611,517MeituanMIT
87qwen3-235b-a22b-thinking1,399+/-79,128AlibabaApache 2.0
88claude-sonnet-4-thinking1,399+/-435,733AnthropicProprietary
89deepseek-r11,398+/-518,524DeepSeekMIT
90hunyuan-vision-1.5-thinking1,396+/-122,235TencentProprietary
91qwen3-vl-235b-a22b-thinking1,396+/-78,052AlibabaApache 2.0
92amazon-nova-exp-chat-12-101,396+/-103,720AmazonProprietary
93deepseek-v3-03241,394+/-446,144DeepSeekMIT
94mai-1-preview1,393+/-518,095Microsoft AIProprietary
95mimo-v2-flash (non-thinking)1,392+/-425,427XiaomiMIT
96o4-mini-2025-04-161,390+/-446,166OpenAIProprietary
97gpt-5-mini-high1,390+/-527,372OpenAIProprietary
98claude-sonnet-4-202505141,389+/-441,021AnthropicProprietary
99step-3.5-flash1,389+/-613,885StepFunApache 2.0
100o1-preview1,388+/-531,122OpenAIProprietary

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。