DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Text Generation Arena 文本生成模型排行榜

LMArena 评测赛道

文本生成图像编辑文字生成视频图生视频文生图

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

榜首模型

Claude Opus 4.6

最高得分

1,502

模型数量

150

数据版本

2026年04月14日

数据来源: LM Arena

关于本排行榜

本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。

评测方法概要

匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。

Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。

场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。

DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。

筛选条件

榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
1Claude Opus 4.61,502/17,219Anthropic/
2Claude Opus 4.61,496/18,377Anthropic/
3Muse Spark1,495/4,182Facebook AI研究实验室/
4Gemini 3.1 Pro Preview1,493/21,708Google Deep Mind/
5Gemini 3.0 Pro (Preview 11-2025)1,486/41,578Google Deep Mind/
6grok-4.20-beta11,485/10,884xAI/
7gpt-5.4-high1,481/10,633OpenAI/
8grok-4.20-beta-0309-reasoning1,479/10,713xAI/
9gpt-5.2-chat-latest-202602101,476/16,810OpenAI/
10grok-4.20-multi-agent-beta-03091,476/11,079xAI/
11Gemini 3.0 Flash1,474/30,922Google Deep Mind/
12Claude Opus 41,473/37,292Anthropic/
13GLM 5.11,471/6,274智谱AI/
14Grok 4.1 Thinking1,470/48,508xAI/
15Claude Opus 41,469/48,318Anthropic/
16gpt-5.41,466/10,990OpenAI/
17qwen3.5-max-preview1,466/8,774Alibaba/
18Gemini 3.0 Flash1,462/34,519Google Deep Mind/
19claude-sonnet-4-61,461/10,935Anthropic/
20dola-seed-2.0-pro1,460/19,770Bytedance/
21Grok 4.11,460/52,460xAI/
22gpt-5.4-mini-high1,457/8,174OpenAI/
23GLM-51,456/14,988智谱AI/
24gpt-5.3-chat-latest1,455/15,448OpenAI/
25GPT-5.1 Pro1,454/41,035OpenAI/
26claude-sonnet-4-5-20250929-thinking-32k1,452/61,159Anthropic/
27claude-sonnet-4-5-202509291,451/59,047Anthropic/
28Kimi K2 Thinking1,451/21,678Moonshot AI/
29gemma-4-31b1,450/5,839Google/
30ERNIE 5.01,450/23,507百度/
31ERNIE 5.01,449/9,808百度/
32claude-opus-4-1-20250805-thinking-16k1,448/50,147Anthropic/
33Gemini 2.5 Pro Experimental 03-251,448/108,717Google Deep Mind/
34mimo-v2-pro1,447/9,239Xiaomi/
35claude-opus-4-1-202508051,447/77,831Anthropic/
36Qwen3.5-397B-A17B1,446/16,360阿里巴巴/
37GPT-4.51,444/14,547OpenAI/
38chatgpt-4o-latest-202503261,443/82,981OpenAI/
39GLM-4.71,443/12,180智谱AI/
40GPT-5.2 Pro1,441/31,439OpenAI/
41GPT-5.21,439/28,519OpenAI/
42gemma-4-26b-a4b1,439/5,795Google/
43GPT-5.1 Instant1,438/43,688OpenAI/
44longcat-flash-chat-2602-exp1,437/6,751Meituan/
45gemini-3.1-flash-lite-preview1,436/16,969Google/
46Qwen3 Max (Preview)1,435/27,926阿里巴巴/
47GPT-5-Pro1,433/32,239OpenAI/
48kimi-k2.5-instant1,432/8,234Moonshot/
49grok-4-1-fast-reasoning1,432/43,555xAI/
50OpenAI o31,431/60,167OpenAI/
51kimi-k2-thinking-turbo1,430/47,037Moonshot/
52amazon-nova-experimental-chat-26-02-101,428/3,448Amazon/
53GPT-51,426/31,842OpenAI/
54GLM-4.61,426/35,904智谱AI/
55DeepSeek V3.2-Exp1,425/9,140DeepSeek-AI/
56qwen3-max-2025-09-231,424/9,244Alibaba/
57claude-opus-4-20250514-thinking-16k1,424/37,185Anthropic/
58DeepSeek V3.21,423/42,036DeepSeek-AI/
59Qwen3-235B-A22B-25071,423/82,850阿里巴巴/
60DeepSeek V3.21,423/36,441DeepSeek-AI/
61DeepSeek V3.2-Exp1,423/12,013DeepSeek-AI/
62DeepSeek-R1-05281,422/18,590DeepSeek-AI/
63Grok 4 Fast1,421/6,864xAI/
64ERNIE 5.01,419/4,762百度/
65qwen3.5-122b-a10b1,418/13,066Alibaba/
66kimi-k2-0905-preview1,418/11,862Moonshot/
67DeepSeek-V3.11,418/15,068DeepSeek-AI/
68Kimi K21,417/27,861Moonshot AI/
69DeepSeek-V3.11,417/11,825DeepSeek-AI/
70deepseek-v3.1-terminus-thinking1,416/3,488DeepSeek/
71DeepSeek-V3.1 Terminus1,416/3,722DeepSeek-AI/
72Qwen3-VL-235B-A22B-Instruct1,416/11,608阿里巴巴/
73Mistral Large 31,415/39,232MistralAI/
74amazon-nova-experimental-chat-26-01-101,415/3,432Amazon/
75gpt-4.1-2025-04-141,413/51,399OpenAI/
76Claude Opus 41,412/44,550Anthropic/
77Grok 31,412/33,045xAI/
78Gemini 2.5 Flash1,411/108,193Google Deep Mind/
79GLM-4.51,411/24,507智谱AI/
80grok-4-07091,410/41,734xAI/
81Magistral-Medium-25061,410/78,272MistralAI/
82claude-haiku-4-5-202510011,408/60,452Anthropic/
83gemini-2.5-flash-preview-09-20251,405/33,128Google/
84grok-4-fast-reasoning1,404/18,875xAI/
85qwen3-235b-a22b-no-thinking1,403/38,470Alibaba/
86minimax-m2.71,402/7,635MiniMax/
87gpt-5.4-nano-high1,402/7,479OpenAI/
88MiniMax M2.51,402/18,224MiniMaxAI/
89qwen3.5-27b1,402/12,770Alibaba/
90o1-2024-12-171,401/27,807OpenAI/
91qwen3-next-80b-a3b-instruct1,401/23,060Alibaba/
92longcat-flash-chat1,401/11,476Meituan/
93qwen3-235b-a22b-thinking-25071,400/9,061Alibaba/
94qwen3.5-flash1,399/13,598Alibaba/
95claude-sonnet-4-20250514-thinking-32k1,398/35,416Anthropic/
96DeepSeek-R11,397/18,524DeepSeek-AI/
97hunyuan-vision-1.5-thinking1,397/2,225Tencent/
98qwen3.5-35b-a3b1,396/13,268Alibaba/
99qwen3-vl-235b-a22b-thinking1,396/8,021Alibaba/
100amazon-nova-experimental-chat-12-101,395/3,707Amazon/
101DeepSeek-V3-03241,394/45,799DeepSeek-AI/
102mai-1-preview1,393/18,015Microsoft AI/
103mimo-v2-flash (non-thinking)1,391/31,132Xiaomi/
104Step 3.5 Flash1,391/19,379StepFunAI/
105o4-mini-2025-04-161,390/45,738OpenAI/
106gpt-5-mini-high1,389/27,246OpenAI/
107claude-sonnet-4-202505141,389/40,649Anthropic/
108o1-preview1,388/31,122OpenAI/
109qwen3-coder-480b-a35b-instruct1,387/25,958Alibaba/
110hunyuan-t1-202507111,387/4,736Tencent/
111mimo-v2-flash (thinking)1,387/11,021Xiaomi/
112claude-3-7-sonnet-20250219-thinking-32k1,386/38,995Anthropic/
113mistral-medium-25051,386/33,442Mistral/
114minimax-m2.1-preview1,385/17,234MiniMax/
115qwen3-30b-a3b-instruct-25071,383/23,932Alibaba/
116hunyuan-turbos-202504161,383/10,775Tencent/
117gpt-4.1-mini-2025-04-141,382/39,548OpenAI/
118gemini-2.5-flash-lite-preview-09-2025-no-thinking1,380/47,541Google/
119GLM-4.6V1,378/2,817智谱AI/
120trinity-large-preview1,374/14,164Arcee AI/
121gemini-2.5-flash-lite-preview-06-17-thinking1,374/33,170Google/
122qwen3-235b-a22b1,374/26,423Alibaba/
123qwen2.5-max1,374/32,709Alibaba/
124glm-4.5-air1,373/31,372Z.ai/
125claude-3-5-sonnet-202410221,372/88,515Anthropic/
126claude-3-7-sonnet-202502191,370/43,394Anthropic/
127qwen3-next-80b-a3b-thinking1,369/13,836Alibaba/
128glm-4.7-flash1,368/11,819Z.ai/
129amazon-nova-experimental-chat-11-101,367/25,539Amazon/
130gemma-3-27b-it1,365/47,842Google/
131minimax-m11,363/35,505MiniMax/
132o3-mini-high1,363/18,589OpenAI/
133grok-3-mini-high1,363/17,078xAI/
134nvidia-nemotron-3-super-120b-a12b1,361/7,435Nvidia/
135gemini-2.0-flash-0011,360/43,911Google/
136deepseek-v31,358/21,770DeepSeek/
137grok-3-mini-beta1,357/22,879xAI/
138mistral-small-25061,357/17,850Mistral/
139intellect-31,356/5,357Prime Intellect/
140gpt-oss-120b1,354/30,883OpenAI/
141command-a-03-20251,353/56,663Cohere/
142glm-4.5v1,353/4,983Z.ai/
143gemini-2.0-flash-lite-preview-02-051,353/24,955Google/
144gemini-1.5-pro-0021,351/55,606Google/
145amazon-nova-experimental-chat-10-201,350/11,535Amazon/
146hunyuan-turbos-202502261,348/2,220Tencent/
147step-31,348/6,585StepFun/
148o3-mini1,347/57,563OpenAI/
149qwen3-32b1,347/3,926Alibaba/
150llama-3.1-nemotron-ultra-253b-v11,347/2,549Nvidia/

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。