Text Generation Arena 文本生成模型排行榜
基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。
榜首模型
Claude Opus 4.7 (max)
最高得分
57
模型数量
201
数据版本
2026年04月23日
数据来源: LM Arena
关于本排行榜
本排行榜展示了当前最强 AI 大模型在文本生成任务中的综合实力排名。数据来源于 LMArena(前身为 LMSYS Chatbot Arena),这是目前全球最大的 AI 模型众包评测平台。用户在平台上与两个匿名模型同时对话,并投票选出更好的回答——排名完全由真实用户的偏好决定,而非实验室基准测试。
评测方法概要
匿名盲测:用户同时与两个"隐藏身份"的模型对话,根据回答质量投票,排除品牌偏见。
Elo 评分:基于国际象棋领域的 Elo Rating 体系(Bradley-Terry 模型),通过对战结果计算每个模型的实力分数。分数越高,说明模型在真实对话中被用户选中的概率越大。
场景覆盖广泛:涵盖编程、创意写作、数学推理、知识问答、角色扮演等高频真实场景。
DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。
筛选条件
榜单历史快照月份:
排名总表
| 排名 | 模型名称 | 得分 | 95% CI | 投票数 | 机构 | 许可证 |
|---|---|---|---|---|---|---|
| 1 | Claude Opus 4.7 (max) | 57 | / | / | Anthropic | / |
| 2 | Gemini 3.1 Pro Preview | 57 | / | / | Google Deep Mind | / |
| 3 | GPT-5.4 | 57 | / | / | OpenAI | / |
| 4 | Kimi K2.6 | 54 | / | / | Kimi | / |
| 5 | MiMo-V2.5-Pro | 54 | / | / | Xiaomi | / |
| 6 | GPT-5.3 Codex | 54 | / | / | OpenAI | / |
| 7 | Claude Opus 4.6 | 53 | / | / | Anthropic | / |
| 8 | Muse Spark | 52 | / | / | Facebook AI研究实验室 | / |
| 9 | Claude Opus 4.7 (Non-reasoning, high) | 52 | / | / | Anthropic | / |
| 10 | Qwen3.6 Max Preview | 52 | / | / | Alibaba | / |
| 11 | Claude Sonnet 4.6 | 52 | / | / | Anthropic | / |
| 12 | GLM 5.1 | 51 | / | / | 智谱AI | / |
| 13 | Qwen 3.6 Plus Preview | 50 | / | / | 阿里巴巴 | / |
| 14 | GLM-5 | 50 | / | / | 智谱AI | / |
| 15 | MiniMax-M2.7 | 50 | / | / | MiniMax | / |
| 16 | Grok 4.20 0309 v2 | 49 | / | / | xAI | / |
| 17 | MiMo-V2-Pro | 49 | / | / | Xiaomi | / |
| 18 | GPT-5.4 mini | 49 | / | / | OpenAI | / |
| 19 | Kimi K2.5 | 47 | / | / | Moonshot AI | / |
| 20 | GLM-5-Turbo | 47 | / | / | 智谱AI | / |
| 21 | Claude Opus 4.6 (high) | 46 | / | / | Anthropic | / |
| 22 | Gemini 3.0 Flash | 46 | / | / | Google Deep Mind | / |
| 23 | Qwen3.5-397B-A17B | 45 | / | / | 阿里巴巴 | / |
| 24 | Nova 2 Omni(Preview) | 45 | / | / | 亚马逊 | / |
| 25 | Claude Sonnet 4.6 | 44 | / | / | Anthropic | / |
| 26 | GPT-5.4 nano | 44 | / | / | OpenAI | / |
| 27 | GLM 5.1 | 44 | / | / | 智谱AI | / |
| 28 | Qwen3.6 35B A3B | 43 | / | / | Alibaba | / |
| 29 | MiMo-V2-Omni | 43 | / | / | Xiaomi | / |
| 30 | GLM-5V-Turbo | 43 | / | / | 智谱AI | / |
| 31 | Claude Sonnet 4.6 | 43 | / | / | Anthropic | / |
| 32 | Qwen3.5-27B | 42 | / | / | 阿里巴巴 | / |
| 33 | DeepSeek V3.2 | 42 | / | / | DeepSeek-AI | / |
| 34 | Qwen3.5-122B-A10B | 42 | / | / | 阿里巴巴 | / |
| 35 | Gemini 2.0 Flash Experimental | 41 | / | / | DeepMind | / |
| 36 | Gemini 3.1 Pro Preview | 41 | / | / | Google Deep Mind | / |
| 37 | GLM-5 | 41 | / | / | 智谱AI | / |
| 38 | Qwen3.5-397B-A17B | 40 | / | / | 阿里巴巴 | / |
| 39 | Qwen3-Max-Thinking | 40 | / | / | 阿里巴巴 | / |
| 40 | Gemma 4 31B | 39 | / | / | / | |
| 41 | Qwen3.5-Omni-Plus | 39 | / | / | 阿里巴巴 | / |
| 42 | Grok 4.1 Fast | 39 | / | / | xAI | / |
| 43 | Step 3.5 Flash | 38 | / | / | StepFunAI | / |
| 44 | OpenAI o3 | 38 | / | / | OpenAI | / |
| 45 | GPT-5.4 nano | 38 | / | / | OpenAI | / |
| 46 | Step 3.5 Flash | 38 | / | / | StepFunAI | / |
| 47 | GPT-5.4 mini | 38 | / | / | OpenAI | / |
| 48 | Kimi K2.5 | 37 | / | / | Moonshot AI | / |
| 49 | Qwen3.5-27B | 37 | / | / | 阿里巴巴 | / |
| 50 | Qwen3.5-35B-A3B | 37 | / | / | 阿里巴巴 | / |
| 51 | Haiku 4.5 | 37 | / | / | Anthropic | / |
| 52 | NVIDIA Nemotron 3 Super | 36 | / | / | NVIDIA | / |
| 53 | Qwen3.5-122B-A10B | 36 | / | / | 阿里巴巴 | / |
| 54 | Nova 2 Pro(Preview) | 36 | / | / | 亚马逊 | / |
| 55 | GPT-5.4 (Non-reasoning) | 35 | / | / | OpenAI | / |
| 56 | Gemini 3.0 Flash | 35 | / | / | Google Deep Mind | / |
| 57 | Gemini 2.5-Pro | 35 | / | / | Google Deep Mind | / |
| 58 | Nova 2 Lite | 35 | / | / | 亚马逊 | / |
| 59 | Gemini 3.1 Flash-Lite Preview | 34 | / | / | / | |
| 60 | Doubao Seed Code | 34 | / | / | ByteDance Seed | / |
| 61 | GPT OSS 120B | 33 | / | / | OpenAI | / |
| 62 | Mercury 2 | 33 | / | / | Inception | / |
| 63 | Qwen3.5-9B-Instruct | 32 | / | / | 阿里巴巴 | / |
| 64 | Gemma 4 31B | 32 | / | / | / | |
| 65 | K-EXAONE | 32 | / | / | LG AI Research | / |
| 66 | DeepSeek V3.2 | 32 | / | / | DeepSeek-AI | / |
| 67 | Grok-3 mini - Reasoning | 32 | / | / | xAI | / |
| 68 | Nova 2 Pro(Preview) | 32 | / | / | 亚马逊 | / |
| 69 | Trinity Large Thinking | 32 | / | / | Arcee AI | / |
| 70 | Qwen3.6 35B A3B | 32 | / | / | Alibaba | / |
| 71 | Gemma 4 26B A4B | 31 | / | / | / | |
| 72 | Haiku 4.5 | 31 | / | / | Anthropic | / |
| 73 | Qwen3.5-35B-A3B | 31 | / | / | 阿里巴巴 | / |
| 74 | MiMo-V2-Flash | 30 | / | / | Xiaomi | / |
| 75 | Nova 2 Lite | 30 | / | / | 亚马逊 | / |
| 76 | DeepSeek V3.2 Speciale | 29 | / | / | DeepSeek-AI | / |
| 77 | ERNIE 5.0 | 29 | / | / | 百度 | / |
| 78 | Grok 4.20 0309 v2 | 29 | / | / | xAI | / |
| 79 | Grok Code Fast 1 | 29 | / | / | xAI | / |
| 80 | Nemotron Cascade 2 30B A3B | 28 | / | / | NVIDIA | / |
| 81 | Qwen3-Coder-Next | 28 | / | / | 阿里巴巴 | / |
| 82 | Nova 2 Omni(Preview) | 28 | / | / | 亚马逊 | / |
| 83 | Mistral Small 4 | 28 | / | / | Mistral | / |
| 84 | Qwen3.5-9B-Instruct | 27 | / | / | 阿里巴巴 | / |
| 85 | Magistral Medium 1.2 | 27 | / | / | Mistral | / |
| 86 | Gemma 4 26B A4B | 27 | / | / | / | |
| 87 | Qwen3.5 4B | 27 | / | / | Alibaba | / |
| 88 | DeepSeek-R1-0528 | 27 | / | / | DeepSeek-AI | / |
| 89 | Qwen3-Next | 27 | / | / | 阿里巴巴 | / |
| 90 | Ling 2.6 Flash | 26 | / | / | InclusionAI | / |
| 91 | Solar Pro 3 | 26 | / | / | Upstage | / |
| 92 | Qwen3.5-Omni-Flash | 26 | / | / | 阿里巴巴 | / |
| 93 | JT-MINI | 25 | / | / | China Mobile | / |
| 94 | Qwen3-Coder-480B-A35B | 25 | / | / | 阿里巴巴 | / |
| 95 | Nova 2 Lite | 25 | / | / | 亚马逊 | / |
| 96 | GPT OSS 20B | 24 | / | / | OpenAI | / |
| 97 | GPT OSS 120B | 24 | / | / | OpenAI | / |
| 98 | GPT-5.4 nano | 24 | / | / | OpenAI | / |
| 99 | NVIDIA Nemotron 3 Nano | 24 | / | / | NVIDIA | / |
| 100 | LongCat Flash Lite | 24 | / | / | LongCat | / |
| 101 | Grok 4.1 Fast | 24 | / | / | xAI | / |
| 102 | K-EXAONE | 23 | / | / | LG AI Research | / |
| 103 | GPT-5.4 mini | 23 | / | / | OpenAI | / |
| 104 | Nova 2 Omni(Preview) | 23 | / | / | 亚马逊 | / |
| 105 | Nova 2 Pro(Preview) | 23 | / | / | 亚马逊 | / |
| 106 | Mi:dm K 2.5 Pro | 23 | / | / | Korea Telecom | / |
| 107 | Mistral Large 3 | 23 | / | / | MistralAI | / |
| 108 | Ring-1T | 23 | / | / | InclusionAI | / |
| 109 | Qwen3.5 4B | 23 | / | / | Alibaba | / |
| 110 | INTELLECT-3 | 22 | / | / | Prime Intellect | / |
| 111 | Devstral 2 | 22 | / | / | Mistral | / |
| 112 | Solar Open 100B | 22 | / | / | Upstage | / |
| 113 | Gemini 2.5 Flash-Lite-Preview-09-2025 | 22 | / | / | Google Deep Mind | / |
| 114 | Mistral Medium 3.1 | 21 | / | / | Mistral | / |
| 115 | GPT OSS 20B | 21 | / | / | OpenAI | / |
| 116 | Qwen3-Next | 20 | / | / | 阿里巴巴 | / |
| 117 | Devstral Small 2 | 19 | / | / | Mistral | / |
| 118 | Gemini 2.5 Flash-Lite-Preview-09-2025 | 19 | / | / | Google Deep Mind | / |
| 119 | Motif-2-12.7B | 19 | / | / | Motif Technologies | / |
| 120 | Ling-1T | 19 | / | / | InclusionAI | / |
| 121 | Nova Premier | 19 | / | / | Amazon | / |
| 122 | Gemma 4 E4B | 19 | / | / | / | |
| 123 | Llama Nemotron Super 49B v1.5 | 19 | / | / | NVIDIA | / |
| 124 | Mistral Small 4 | 19 | / | / | Mistral | / |
| 125 | Llama 3.3 Nemotron Super 49B | 18 | / | / | NVIDIA | / |
| 126 | Llama 4 Maverick | 18 | / | / | Facebook AI研究实验室 | / |
| 127 | Sarvam 105B (high) | 18 | / | / | Sarvam | / |
| 128 | Magistral Small 1.2 | 18 | / | / | Mistral | / |
| 129 | Nova 2 Lite | 18 | / | / | 亚马逊 | / |
| 130 | Llama3.1-405B | 17 | / | / | Facebook AI研究实验室 | / |
| 131 | EXAONE 4.0 32B | 17 | / | / | LG AI Research | / |
| 132 | Nova 2 Omni(Preview) | 17 | / | / | 亚马逊 | / |
| 133 | DeepSeek-R1-0528-Qwen3-8B | 16 | / | / | DeepSeek-AI | / |
| 134 | Qwen3.5 2B | 16 | / | / | Alibaba | / |
| 135 | Nanbeige4.1-3B | 16 | / | / | Nanbeige | / |
| 136 | Ministral 3 14B | 16 | / | / | MistralAI | / |
| 137 | DeepSeek-R1-Distill-Llama-70B | 16 | / | / | DeepSeek-AI | / |
| 138 | Falcon-H1R-7B | 16 | / | / | TII UAE | / |
| 139 | Ling-flash-2.0 | 16 | / | / | InclusionAI | / |
| 140 | Qwen3-Omni-30B-A3B | 16 | / | / | 阿里巴巴 | / |
| 141 | Step3 VL 10B | 15 | / | / | StepFun | / |
| 142 | Gemma 4 E2B | 15 | / | / | / | |
| 143 | Llama Nemotron Ultra | 15 | / | / | NVIDIA | / |
| 144 | ERNIE-4.5-300B-A47B | 15 | / | / | 百度 | / |
| 145 | Solar Pro 2 | 15 | / | / | Upstage | / |
| 146 | NVIDIA Nemotron Nano 12B v2 VL | 15 | / | / | NVIDIA | / |
| 147 | Ministral 3 8B | 15 | / | / | MistralAI | / |
| 148 | Gemma 4 E4B | 15 | / | / | / | |
| 149 | NVIDIA Nemotron Nano 9B V2 | 15 | / | / | NVIDIA | / |
| 150 | NVIDIA Nemotron 3 Nano 4B | 15 | / | / | NVIDIA | / |
| 151 | Qwen3.5 2B | 15 | / | / | Alibaba | / |
| 152 | Llama Nemotron Super 49B v1.5 | 15 | / | / | NVIDIA | / |
| 153 | Llama3.3-70B-Instruct | 14 | / | / | Facebook AI研究实验室 | / |
| 154 | Llama 3.1 Nemotron Nano 4B v1.1 | 14 | / | / | NVIDIA | / |
| 155 | Kimi Linear 48B A3B Instruct | 14 | / | / | Kimi | / |
| 156 | Llama 3.3 Nemotron Super 49B | 14 | / | / | NVIDIA | / |
| 157 | Ring-flash-2.0 | 14 | / | / | InclusionAI | / |
| 158 | Solar Pro 2 | 14 | / | / | Upstage | / |
| 159 | Llama 4 Scout | 14 | / | / | Facebook AI研究实验室 | / |
| 160 | C4AI Command A (202503) | 13 | / | / | CohereAI | / |
| 161 | Llama 3.1 Nemotron 70B | 13 | / | / | NVIDIA | / |
| 162 | NVIDIA Nemotron 3 Nano | 13 | / | / | NVIDIA | / |
| 163 | NVIDIA Nemotron Nano 9B V2 | 13 | / | / | NVIDIA | / |
| 164 | Sarvam 30B (high) | 12 | / | / | Sarvam | / |
| 165 | Gemma 4 E2B | 12 | / | / | / | |
| 166 | R1 1776 | 12 | / | / | Perplexity | / |
| 167 | Llama 3.2-Vision-90B | 12 | / | / | Facebook AI研究实验室 | / |
| 168 | EXAONE 4.0 32B | 12 | / | / | LG AI Research | / |
| 169 | Ministral 3 3B | 11 | / | / | Mistral | / |
| 170 | Jamba 1.7 Large | 11 | / | / | AI21 Labs | / |
| 171 | Granite 4.0 H Small | 11 | / | / | IBM | / |
| 172 | Qwen3-Omni-30B-A3B | 11 | / | / | 阿里巴巴 | / |
| 173 | Qwen3.5 0.8B | 11 | / | / | Alibaba | / |
| 174 | LFM2 24B A2B | 10 | / | / | Liquid AI | / |
| 175 | Phi 4 - 14B | 10 | / | / | Microsoft Azure | / |
| 176 | Amazon Nova Micro | 10 | / | / | 亚马逊 | / |
| 177 | NVIDIA Nemotron Nano 12B v2 VL | 10 | / | / | NVIDIA | / |
| 178 | Phi-4-multimodal-instruct | 10 | / | / | Microsoft Azure | / |
| 179 | Qwen3.5 0.8B | 10 | / | / | Alibaba | / |
| 180 | Jamba Reasoning 3B | 10 | / | / | AI21 Labs | / |
| 181 | Gemini 3.0 Flash | 10 | / | / | Google Deep Mind | / |
| 182 | Ling-mini-2.0 | 9 | / | / | InclusionAI | / |
| 183 | Llama 3.2-Vision-11B | 9 | / | / | Facebook AI研究实验室 | / |
| 184 | Phi-4-mini-instruct (3.8B) | 8 | / | / | Microsoft Azure | / |
| 185 | Exaone 4.0 1.2B | 8 | / | / | LG AI Research | / |
| 186 | Exaone 4.0 1.2B | 8 | / | / | LG AI Research | / |
| 187 | LFM2.5-1.2B-Thinking | 8 | / | / | Liquid AI | / |
| 188 | Jamba 1.7 Mini | 8 | / | / | AI21 Labs | / |
| 189 | LFM2.5-1.2B-Instruct | 8 | / | / | Liquid AI | / |
| 190 | LFM2 2.6B | 8 | / | / | Liquid AI | / |
| 191 | Granite 4.0 H 1B | 8 | / | / | IBM | / |
| 192 | Gemma 3-270M | 8 | / | / | Google Deep Mind | / |
| 193 | Apertus 70B Instruct | 8 | / | / | Swiss AI Initiative | / |
| 194 | Granite 4.0 Micro | 8 | / | / | IBM | / |
| 195 | Granite 4.0 1B | 7 | / | / | IBM | / |
| 196 | LFM2 8B A1B | 7 | / | / | Liquid AI | / |
| 197 | LFM2.5-VL-1.6B | 6 | / | / | Liquid AI | / |
| 198 | Granite 4.0 350M | 6 | / | / | IBM | / |
| 199 | Apertus 8B Instruct | 6 | / | / | Swiss AI Initiative | / |
| 200 | Granite 4.0 H 350M | 5 | / | / | IBM | / |
| 201 | Tiny Aya Global | 5 | / | / | Cohere | / |
数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。