Text Generation Arena Leaderboard

The latest AI text generation leaderboard based on LMArena anonymous user voting. Covers Elo scores, confidence intervals, and vote counts for leading language models.

Top Model

Claude Fable 5

Top Score

1,508

Model Count

367

Data version

2026年06月16日

Data source: LM Arena

About This Leaderboard

This leaderboard ranks the strongest AI models for text generation. Data comes from LMArena (formerly LMSYS Chatbot Arena), the world's largest crowdsourced AI evaluation platform. Users chat with two anonymous models side-by-side and vote for the better response — rankings are determined entirely by real user preferences, not lab benchmarks.

Methodology Overview

Blind testing: Users chat with two anonymous models and vote based on response quality, eliminating brand bias.

Elo scoring: Using the Bradley-Terry model (adapted from chess Elo ratings) to calculate each model's strength score from battle outcomes. Higher scores mean users more frequently prefer that model.

Broad scenario coverage: Testing spans coding, creative writing, math reasoning, Q&A, role-playing, and more.

DataLearner provides in-depth analysis on top of the raw data, linking leaderboard models to the DataLearner model database so you can quickly access model details, API pricing, benchmark scores, and more.

Origin:AllChina
Leaderboard snapshot month:

Ranking Table

RankModelScore95% CIVotesOrganizationLicense
AnthropicClaude Fable 5Anthropic1,508+/-94,297AnthropicProprietary
AnthropicClaude Opus 4.6 (thinking)Anthropic1,504+/-446,410AnthropicProprietary
AnthropicOpus 4.7 (thinking)Anthropic1,502+/-532,629AnthropicProprietary
4AnthropicClaude Opus 4.6Anthropic1,499+/-449,596AnthropicProprietary
5AnthropicOpus 4.7Anthropic1,493+/-533,793AnthropicProprietary
6Muse SparkFacebook AI研究实验室1,487+/-613,607Facebook AI研究实验室Proprietary
7Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1,486+/-460,640Google Deep MindProprietary
8Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1,486+/-441,314Google Deep MindProprietary
9AnthropicClaude Opus 4.8 (thinking)Anthropic1,483+/-612,963AnthropicProprietary
10OpenAIGPT-5.5 (high)OpenAI1,481+/-528,268OpenAIProprietary
11OpenAIGPT-5.4 (high)OpenAI1,478+/-440,959OpenAIProprietary
12AnthropicClaude Opus 4.8Anthropic1,478+/-613,316AnthropicProprietary
13Google Deep MindGemini 3.5 FlashGoogle Deep Mind1,476+/-710,171Google Deep MindProprietary
14OpenAIGPT-5.2OpenAI1,475+/-434,555OpenAIProprietary
15GLM 5.1智谱AI1,475+/-616,101智谱AIMIT
16OpenAIGPT-5.5OpenAI1,475+/-529,071OpenAIProprietary
17Qwen3.7-Max-Preview阿里巴巴1,475+/-103,740阿里巴巴Proprietary
18xAIgrok-4.20-beta-0309-reasoningxAI1,474+/-442,370xAIProprietary
19xAIGrok 4.20 BetaxAI1,474+/-526,964xAIProprietary
20Google Deep MindGemini 3.0 FlashGoogle Deep Mind1,473+/-430,711Google Deep MindProprietary
21AnthropicClaude Opus 4 (thinking-32k)Anthropic1,473+/-437,087AnthropicProprietary
22OpenAIGPT-5.5 InstantOpenAI1,473+/-526,254OpenAIProprietary
23xAIgrok-4.20-multi-agent-beta-0309xAI1,472+/-441,384xAIProprietary
24AnthropicClaude Sonnet 4.6Anthropic1,472+/-439,561AnthropicProprietary
25GLM-5.2 (max)智谱AI1,471+/-103,357智谱AIMIT
26AnthropicClaude Opus 4Anthropic1,469+/-371,167AnthropicProprietary
27OpenAIGPT-5.4OpenAI1,468+/-443,382OpenAIProprietary
28ERNIE-5.1-Preview百度1,468+/-525,064百度Proprietary
29mimo-v2.5-proXiaomi1,466+/-526,563XiaomiMIT
30xAIGrok 4.1 ThinkingxAI1,466+/-365,623xAIProprietary
31Qwen3.5 Max Preview阿里巴巴1,465+/-521,564阿里巴巴Proprietary
32Qwen3.6-Max-Preview阿里巴巴1,461+/-85,216阿里巴巴Proprietary
33Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1,460+/-366,402Google Deep MindProprietary
34Moonshot AIKimi K2.6Moonshot AI1,460+/-525,456Moonshot AIModified MIT
35xAIGrok 4.1xAI1,460+/-367,759xAIProprietary
36DeepSeek-AIDeepSeek-V4-Pro (thinking)DeepSeek-AI1,458+/-526,928DeepSeek-AIMIT
37GLM-5智谱AI1,457+/-523,246智谱AIMIT
38DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1,456+/-528,720DeepSeek-AIMIT
39AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1,455+/-382,494AnthropicProprietary
40AnthropicClaude Sonnet 4.5Anthropic1,455+/-380,950AnthropicProprietary
41DOLA Seed 2.0 Pro字节跳动Seed团队1,455+/-450,401字节跳动Seed团队Proprietary
42OpenAIGPT-5.1 Pro (high)OpenAI1,455+/-440,820OpenAIProprietary
43DeepMindGemma 4 31BDeepMind1,451+/-85,884DeepMindApache 2.0
44Moonshot AIKimi K2 ThinkingMoonshot AI1,450+/-447,780Moonshot AIModified MIT
45ERNIE 5.0百度1,449+/-79,748百度Proprietary
46AnthropicOpus 4.1 (thinking-16k)Anthropic1,449+/-349,802AnthropicProprietary
47OpenAIGPT-5.3OpenAI1,449+/-433,125OpenAIProprietary
48mimo-v2-proXiaomi1,448+/-524,606XiaomiProprietary
49MiniMaxminimax-m3MiniMax1,448+/-711,264MiniMaxProprietary
50OpenAIGPT-5.4 mini (high)OpenAI1,448+/-439,525OpenAIProprietary
51AnthropicOpus 4.1Anthropic1,447+/-377,333AnthropicProprietary
52ERNIE 5.0百度1,447+/-435,299百度Proprietary
53Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1,446+/-3124,588Google Deep MindProprietary
54OpenAIGPT-4.5OpenAI1,445+/-614,547OpenAIProprietary
55Qwen 3.6 Plus Preview阿里巴巴1,444+/-528,997阿里巴巴Proprietary
56xAIGrok 4.3 BetaxAI1,444+/-528,229xAIProprietary
57Qwen3.5-397B-A17B阿里巴巴1,444+/-443,048阿里巴巴Apache 2.0
58OpenAIGPT-4o(2025-03-27)OpenAI1,443+/-382,447OpenAIProprietary
59GLM-4.7智谱AI1,443+/-612,121智谱AIMIT
60OpenAIGPT-5.1 InstantOpenAI1,439+/-443,457OpenAIProprietary
61DeepMindGemma 4 26B A4BDeepMind1,438+/-85,813DeepMindApache 2.0
62OpenAIGPT-5.2 Pro (high)OpenAI1,438+/-448,063OpenAIProprietary
63DeepSeek-AIDeepSeek-V4-Flash (thinking)DeepSeek-AI1,436+/-528,215DeepSeek-AIMIT
64Meituanlongcat-flash-chat-2602-expMeituan1,436+/-528,187MeituanProprietary
65Qwen3 Max (Preview)阿里巴巴1,435+/-527,716阿里巴巴Proprietary
66OpenAIGPT-5.2OpenAI1,435+/-359,625OpenAIProprietary
67DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1,434+/-528,291DeepSeek-AIMIT
68OpenAIGPT-5-Pro (high)OpenAI1,434+/-531,928OpenAIProprietary
69mimo-v2.5Xiaomi1,433+/-527,111XiaomiMIT
70Googlegemini-3.1-flash-lite-previewGoogle1,432+/-448,525GoogleProprietary
71mimo-v2-omniXiaomi1,432+/-612,528XiaomiProprietary
72Moonshot AIKimi K2.5 InstantMoonshot AI1,431+/-78,177Moonshot AIModified MIT
73OpenAIOpenAI o3OpenAI1,431+/-459,744OpenAIProprietary
74xAIGrok 4.1 Fast (fast-reasoning)xAI1,431+/-356,873xAIProprietary
75Moonshot AIKimi K2 Thinking (thinking-turbo)Moonshot AI1,430+/-362,098Moonshot AIModified MIT
76Amazonamazon-nova-experimental-chat-26-02-10Amazon1,427+/-103,417AmazonProprietary
77OpenAIGPT-5OpenAI1,427+/-431,569OpenAIProprietary
78Mistralmistral-medium-3.5Mistral1,426+/-710,739MistralModified MIT
79GLM-4.6智谱AI1,425+/-435,640智谱AIMIT
80DeepSeek-AIDeepSeek V3.2DeepSeek-AI1,425+/-447,303DeepSeek-AIMIT
81DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1,425+/-79,069DeepSeek-AIMIT
82AnthropicClaude Opus 4 (thinking-16k)Anthropic1,424+/-436,887AnthropicProprietary
83Alibabaqwen3-max-2025-09-23Alibaba1,424+/-69,151AlibabaProprietary
84Qwen3-235B-A22B-2507阿里巴巴1,423+/-397,241阿里巴巴Apache 2.0
85DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1,423+/-611,922DeepSeek-AIMIT
86DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1,423+/-441,085DeepSeek-AIMIT
87DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1,422+/-618,463DeepSeek-AIMIT
88xAIGrok 4 FastxAI1,421+/-86,809xAIProprietary
89ERNIE 5.0百度1,419+/-94,705百度Proprietary
90Moonshot AIKimi K2 0905Moonshot AI1,418+/-711,780Moonshot AIModified MIT
91DeepSeek-AIDeepSeek-V3.1 Terminus (thinking)DeepSeek-AI1,418+/-103,462DeepSeek-AIMIT
92Moonshot AIKimi K2Moonshot AI1,417+/-527,637Moonshot AIModified MIT
93DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1,417+/-614,958DeepSeek-AIMIT
94Qwen3.5-122B-A10B阿里巴巴1,417+/-428,575阿里巴巴Apache 2.0
95DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1,417+/-711,737DeepSeek-AIMIT
96MiniMaxAIMiniMax-M2.7MiniMaxAI1,417+/-434,620MiniMaxAIModified MIT
97Nvidianvidia-nemotron-3-ultra-550b-a55b-nvfp4Nvidia1,416+/-86,153NvidiaOpenMDW-1.1
98DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1,416+/-103,702DeepSeek-AIMIT
99Amazonamazon-nova-experimental-chat-26-01-10Amazon1,416+/-103,406AmazonProprietary
100MistralAIMistral Large 3MistralAI1,416+/-444,094MistralAIApache 2.0
101Qwen3-VL-235B-A22B-Instruct阿里巴巴1,415+/-611,512阿里巴巴Apache 2.0
102OpenAIGPT-4.1OpenAI1,414+/-450,981OpenAIProprietary
103Tencenthunyuan-hy3-previewTencent1,413+/-86,678Tencenttencent-hunyuan-community
104AnthropicClaude Opus 4Anthropic1,412+/-444,208AnthropicProprietary
105xAIGrok 3xAI1,412+/-432,905xAIProprietary
106AnthropicHaiku 4.5Anthropic1,411+/-391,153AnthropicProprietary
107GLM-4.5智谱AI1,411+/-524,310智谱AIMIT
108Google Deep MindGemini 2.5 FlashGoogle Deep Mind1,410+/-2124,544Google Deep MindProprietary
109xAIGrok 4xAI1,410+/-441,385xAIProprietary
110MistralAIMagistral-Medium-2506MistralAI1,410+/-394,036MistralAIProprietary
111Qwen3.5-27B阿里巴巴1,409+/-427,421阿里巴巴Apache 2.0
112Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1,404+/-432,910Google Deep MindProprietary
113xAIGrok 4 Fast (fast-reasoning)xAI1,404+/-518,710xAIProprietary
114Alibabaqwen3-235b-a22b-no-thinkingAlibaba1,403+/-538,208AlibabaApache 2.0
115OpenAIGPT-5.4 nano (high)OpenAI1,403+/-438,610OpenAIProprietary
116OpenAIOpenAI o1OpenAI1,402+/-427,807OpenAIProprietary
117Qwen3-Next阿里巴巴1,402+/-522,873阿里巴巴Apache 2.0
118Meituanlongcat-flash-chatMeituan1,401+/-611,401MeituanMIT
119Alibabaqwen3-235b-a22b-thinking-2507Alibaba1,399+/-78,994AlibabaApache 2.0
120AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1,399+/-435,108AnthropicProprietary
121DeepSeek-AIDeepSeek-R1DeepSeek-AI1,398+/-518,524DeepSeek-AIMIT
122StepFunAIStep 3.5 FlashStepFunAI1,397+/-440,958StepFunAIProprietary
123Tencenthunyuan-vision-1.5-thinkingTencent1,396+/-122,216TencentProprietary
124Qwen3.5-35B-A3B阿里巴巴1,396+/-429,248阿里巴巴Apache 2.0
125Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴1,396+/-77,944阿里巴巴Apache 2.0
126DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1,396+/-445,505DeepSeek-AIMIT
127StepFunAIStep 3.5 FlashStepFunAI1,395+/-444,826StepFunAIApache 2.0
128Amazonamazon-nova-experimental-chat-12-10Amazon1,395+/-103,680AmazonProprietary
129mimo-v2-flash (non-thinking)Xiaomi1,393+/-446,705XiaomiMIT
130MiniMaxAIMiniMax M2.5MiniMaxAI1,391+/-441,271MiniMaxAIModified MIT
131OpenAIGPT-5-mini (high)OpenAI1,390+/-527,021OpenAIProprietary
132OpenAIOpenAI o4 - miniOpenAI1,390+/-445,439OpenAIProprietary
133AnthropicClaude Sonnet 4Anthropic1,389+/-440,298AnthropicProprietary
134OpenAIOpenAI o1OpenAI1,388+/-531,122OpenAIProprietary
135Qwen3-Coder-480B-A35B阿里巴巴1,388+/-525,729阿里巴巴Apache 2.0
136AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1,387+/-438,819AnthropicProprietary
137Hunyuan-T1腾讯AI实验室1,387+/-94,704腾讯AI实验室Proprietary
138mimo-v2-flash (thinking)Xiaomi1,387+/-610,956XiaomiMIT
139Mistralmistral-medium-2505Mistral1,387+/-533,224MistralProprietary
140MiniMaxAIM2.1MiniMaxAI1,384+/-517,128MiniMaxAIMIT
141Qwen3-30B-A3B-2507阿里巴巴1,383+/-523,728阿里巴巴Apache 2.0
142OpenAIGPT-4.1 miniOpenAI1,383+/-439,329OpenAIProprietary
143Tencenthunyuan-turbos-20250416Tencent1,382+/-610,722TencentProprietary
144Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind1,380+/-347,228Google Deep MindProprietary
145trinity-large-previewArcee AI1,379+/-430,145Arcee AIApache 2.0
146GLM-4.6V智谱AI1,377+/-112,805智谱AIMIT
147Qwen3-235B-A22B阿里巴巴1,375+/-526,267阿里巴巴Apache 2.0
148Google Deep MindGemini 2.5 Flash-Lite (thinking)Google Deep Mind1,374+/-532,899Google Deep MindProprietary
149Qwen2.5-Max阿里巴巴1,374+/-432,619阿里巴巴Proprietary
150GLM-4.5-Air智谱AI1,373+/-431,077智谱AIMIT
151AnthropicClaude 3.5 SonnetAnthropic1,373+/-388,337AnthropicProprietary
152AnthropicClaude Sonnet 3.7Anthropic1,371+/-443,185AnthropicProprietary
153Qwen3-Next (thinking)阿里巴巴1,370+/-613,693阿里巴巴Apache 2.0
154trinity-large-thinkingArcee AI1,369+/-529,305Arcee AIApache 2.0
155GLM-4.7-Flash智谱AI1,368+/-611,731智谱AIMIT
156Amazonamazon-nova-experimental-chat-11-10Amazon1,367+/-425,383AmazonProprietary
157Google Deep MindGemma 3 - 27B (IT)Google Deep Mind1,366+/-447,529Google Deep MindGemma
158MiniMaxminimax-m1MiniMax1,364+/-435,208MiniMaxApache 2.0
159OpenAIOpenAI o3-mini (high)OpenAI1,363+/-518,589OpenAIProprietary
160OpenAIOpenAI o3-mini (high)OpenAI1,362+/-516,962OpenAIProprietary
161Nvidianvidia-nemotron-3-super-120b-a12bNvidia1,362+/-77,544NvidiaNVIDIA Open Model
162DeepMindGemini 2.0 Flash ExperimentalDeepMind1,360+/-443,748DeepMindProprietary
163DeepSeek-AIDeepSeek-V3DeepSeek-AI1,358+/-521,770DeepSeek-AIDeepSeek
164MistralAIMistral-Small-3.2MistralAI1,358+/-517,708MistralAIApache 2.0
165xAIgrok-3-mini-betaxAI1,357+/-522,715xAIProprietary
166intellect-3Prime Intellect1,357+/-85,331Prime IntellectMIT
167CohereAIC4AI Command A (202503)CohereAI1,354+/-356,266CohereAICC-BY-NC-4.0
168DeepMindGemini 2.0 Flash-LiteDeepMind1,353+/-424,955DeepMindProprietary
169GLM-4.5V智谱AI1,353+/-84,959智谱AIMIT
170OpenAIGPT OSS 120BOpenAI1,353+/-430,635OpenAIApache 2.0
171Google Deep MindGemini 1.5 ProGoogle Deep Mind1,351+/-355,606Google Deep MindProprietary
172Amazonamazon-nova-experimental-chat-10-20Amazon1,350+/-611,470AmazonProprietary
173Tencenthunyuan-turbos-20250226Tencent1,349+/-122,220TencentProprietary
174StepFunAIStep3StepFunAI1,348+/-76,541StepFunAIApache 2.0
175Amazonamazon-nova-experimental-chat-10-09Amazon1,348+/-112,838AmazonProprietary
176OpenAIOpenAI o3-miniOpenAI1,348+/-457,336OpenAIProprietary
177Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1,347+/-122,549NvidiaNvidia Open Model
178Qwen3-32B阿里巴巴1,347+/-93,926阿里巴巴Apache 2.0
179mercury-2Inception AI1,346+/-113,124Inception AIProprietary
180Alibabaqwen-plus-0125Alibaba1,346+/-85,819AlibabaProprietary
181ling-flash-2.0InclusionAI1,346+/-77,006InclusionAIMIT
182MiniMaxAIMiniMax M2MiniMaxAI1,346+/-86,868MiniMaxAIApache 2.0
183OpenAIGPT-4oOpenAI1,346+/-3112,881OpenAIProprietary
184Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1,343+/-103,345NvidiaNvidia Open
185glm-4-plus-0111Zhipu1,343+/-85,760ZhipuProprietary
186AnthropicClaude 3.5 SonnetAnthropic1,342+/-382,419AnthropicProprietary
187Google Deep MindGemma 3 - 12B (IT)Google Deep Mind1,342+/-103,829Google Deep MindGemma
188Tencenthunyuan-turbo-0110Tencent1,341+/-122,290TencentProprietary
189Nova 2 Lite亚马逊1,337+/-612,233亚马逊Proprietary
190OpenAIGPT-5-Nano (high)OpenAI1,337+/-78,266OpenAIProprietary
191OpenAIOpenAI o1-miniOpenAI1,337+/-451,981OpenAIProprietary
192QwQ-32B阿里巴巴1,336+/-425,393阿里巴巴Apache 2.0
193xAIGrok 2xAI1,336+/-463,498xAIProprietary
194Googlegemini-advanced-0514Google1,335+/-550,148GoogleProprietary
195OpenAIGPT-4oOpenAI1,335+/-445,499OpenAIProprietary
196Metallama-3.1-405b-instruct-bf16Meta1,335+/-441,375MetaLlama 3.1 Community
197StepFunstep-2-16k-exp-202412StepFun1,334+/-94,833StepFunProprietary
198Metallama-3.1-405b-instruct-fp8Meta1,333+/-459,656MetaLlama 3.1 Community
199olmo-3.1-32b-instructAi21,330+/-612,220Ai2Apache 2.0
200yi-lightning01 AI1,328+/-527,33201 AIProprietary
201Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1,328+/-122,218NvidiaNvidia
202molmo-2-8bAi21,327+/-21804Ai2Apache 2.0
203Qwen3-30B-A3B阿里巴巴1,327+/-526,486阿里巴巴Apache 2.0
204Llama 4 Maverick InstructFacebook AI研究实验室1,327+/-439,982Facebook AI研究实验室Llama 4
205Tencenthunyuan-large-2025-02-10Tencent1,326+/-103,738TencentProprietary
206OpenAIgpt-4-turbo-2024-04-09OpenAI1,324+/-498,114OpenAIProprietary
207AnthropicClaude 3.5 HaikuAnthropic1,324+/-369,979AnthropicProprietary
208Google Deep MindGemini 1.5 ProGoogle Deep Mind1,323+/-479,138Google Deep MindProprietary
209DeepSeekdeepseek-v2.5-1210DeepSeek1,323+/-86,795DeepSeekDeepSeek
210Llama 4 Scout InstructFacebook AI研究实验室1,323+/-530,286Facebook AI研究实验室Llama
211OpenAIGPT-4.1 nanoOpenAI1,322+/-86,103OpenAIProprietary
212AnthropicClaude3-OpusAnthropic1,321+/-3194,909AnthropicProprietary
213ring-flash-2.0InclusionAI1,321+/-77,148InclusionAIMIT
214StepFunstep-1o-turbo-202506StepFun1,320+/-79,041StepFunProprietary
215glm-4-plusZhipu AI1,319+/-526,126Zhipu AIProprietary
216Google Deep MindGemma-3n-E4BGoogle Deep Mind1,318+/-522,594Google Deep MindGemma
217Llama3.3-70B-InstructFacebook AI研究实验室1,318+/-354,734Facebook AI研究实验室Llama-3.3
218Alibabaqwen-max-0919Alibaba1,318+/-616,478AlibabaQwen
219OpenAIGPT-4o miniOpenAI1,318+/-468,709OpenAIProprietary
220OpenAIGPT OSS 20BOpenAI1,318+/-610,627OpenAIApache 2.0
221Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1,316+/-615,506NvidiaNVIDIA Open Model
222Alibabaqwen2.5-plus-1127Alibaba1,315+/-610,187AlibabaProprietary
223athene-v2-chatNexusFlow1,314+/-524,739NexusFlowNexusFlow
224Mistralmistral-large-2407Mistral1,314+/-445,459MistralMistral Research
225OpenAIGPT-4OpenAI1,313+/-493,439OpenAIProprietary
226OpenAIGPT-4OpenAI1,312+/-4100,105OpenAIProprietary
227Tencenthunyuan-standard-2025-02-10Tencent1,311+/-103,904TencentProprietary
228Googlegemini-1.5-flash-002Google1,309+/-434,902GoogleProprietary
229xAIgrok-2-mini-2024-08-13xAI1,308+/-452,567xAIProprietary
230DeepSeek-AIDeepSeek V2.5DeepSeek-AI1,307+/-524,572DeepSeek-AIDeepSeek
231granite-4.1-8bIBM1,307+/-104,065IBMApache 2.0
232athene-70b-0725NexusFlow1,306+/-619,621NexusFlowCC-BY-NC-4.0
233mercuryInception AI1,306+/-141,953Inception AIProprietary
234olmo-3-32b-thinkAi21,305+/-85,946Ai2Apache 2.0
235Mistralmistral-large-2411Mistral1,305+/-428,073MistralMRL
236MistralAIMagistral-Medium-2506MistralAI1,304+/-611,638MistralAIProprietary
237MistralAIMistral-Small-3.1-24B-Instruct-2503MistralAI1,303+/-533,216MistralAIApache 2.0
238Google Deep MindGemma 3 - 4B (IT)Google Deep Mind1,303+/-94,171Google Deep MindGemma
239Qwen2.5-VL-72B-Instruct阿里巴巴1,303+/-439,406阿里巴巴Qwen
240Llama3.1-70B-InstructFacebook AI研究实验室1,299+/-87,140Facebook AI研究实验室Llama 3.1
241Tencenthunyuan-large-visionTencent1,294+/-95,372TencentProprietary
242Llama3.1-70B-InstructFacebook AI研究实验室1,293+/-455,240Facebook AI研究实验室Llama 3.1 Community
243Amazonamazon-nova-pro-v1.0Amazon1,290+/-524,745AmazonProprietary
244jamba-1.5-largeAI21 Labs1,289+/-78,662AI21 LabsJamba Open
245Googlegemma-2-27b-itGoogle1,289+/-375,754GoogleGemma license
246reka-core-20240904Reka AI1,288+/-77,312Reka AIProprietary
247ibm-granite-h-smallIBM1,287+/-85,684IBMApache 2.0
248OpenAIGPT-4OpenAI1,287+/-554,173OpenAIProprietary
249Googlegemini-1.5-flash-001Google1,286+/-462,833GoogleProprietary
250llama-3.1-tulu-3-70bAi21,286+/-102,846Ai2Llama 3.1
251Nvidiallama-3.1-nemotron-51b-instructNvidia1,286+/-103,749NvidiaLlama 3.1
252olmo-3.1-32b-thinkAi21,285+/-78,501Ai2Apache 2.0
253AnthropicClaude3-SonnetAnthropic1,280+/-4109,284AnthropicProprietary
254gemma-2-9b-it-simpoPrinceton1,280+/-710,072PrincetonMIT
255Nvidianemotron-4-340b-instructNvidia1,276+/-519,659NvidiaNVIDIA Open Model
256Llama3-70B-InstructFacebook AI研究实验室1,276+/-4156,876Facebook AI研究实验室Llama 3 Community
257Coherecommand-r-plus-08-2024Cohere1,276+/-79,866CohereCC-BY-NC-4.0
258OpenAIGPT-4OpenAI1,275+/-488,723OpenAIProprietary
259MistralAIMistral Small 24B Instruct 2501MistralAI1,274+/-614,681MistralAIApache 2.0
260GLM4智谱AI1,273+/-79,788智谱AIProprietary
261reka-flash-20240904Reka AI1,272+/-77,536Reka AIProprietary
262Qwen2.5-Coder-32B-Instruct阿里巴巴1,270+/-85,432阿里巴巴Apache 2.0
263CohereAIC4AI Aya Vision 32BCohereAI1,267+/-527,124CohereAICC-BY-NC-4.0
264Googlegemma-2-9b-itGoogle1,266+/-454,611GoogleGemma license
265DeepSeekdeepseek-coder-v2DeepSeek1,264+/-615,147DeepSeekDeepSeek License
266Qwen2-72B-Instruct阿里巴巴1,261+/-537,325阿里巴巴Qianwen LICENSE
267CohereAIC4AI Command R+CohereAI1,261+/-477,554CohereAICC-BY-NC-4.0
268AnthropicClaude3-HaikuAnthropic1,261+/-4117,701AnthropicProprietary
269Amazonamazon-nova-lite-v1.0Amazon1,260+/-519,372AmazonProprietary
270Googlegemini-1.5-flash-8b-001Google1,259+/-435,558GoogleProprietary
271Microsoft AzurePhi 4 - 14BMicrosoft Azure1,256+/-524,126Microsoft AzureMIT
272olmo-2-0325-32b-instructAi21,251+/-113,334Ai2Apache-2.0
273Coherecommand-r-08-2024Cohere1,250+/-710,140CohereCC-BY-NC-4.0
274Mistralmistral-large-2402Mistral1,242+/-562,436MistralProprietary
275Amazonamazon-nova-micro-v1.0Amazon1,241+/-519,364AmazonProprietary
276jamba-1.5-miniAI21 Labs1,239+/-78,858AI21 LabsJamba Open
277Mistralministral-8b-2410Mistral1,237+/-94,781MistralMRL
278Googlegemini-pro-dev-apiGoogle1,235+/-718,354GoogleProprietary
279Qwen1.5-110B-Chat阿里巴巴1,233+/-626,195阿里巴巴Qianwen LICENSE
280Tencenthunyuan-standard-256kTencent1,233+/-122,728TencentProprietary
281reka-flash-21b-20240226-onlineReka AI1,233+/-715,450Reka AIProprietary
282Qwen1.5-72B-Chat阿里巴巴1,233+/-539,302阿里巴巴Qianwen LICENSE
283MistralAIMixtral-8x22B-Instruct-v0.1MistralAI1,229+/-551,416MistralAIApache 2.0
284Coherecommand-rCohere1,226+/-554,036CohereCC-BY-NC-4.0
285reka-flash-21b-20240226Reka AI1,226+/-624,806Reka AIProprietary
286OpenAIgpt-3.5-turbo-0125OpenAI1,224+/-566,207OpenAIProprietary
287Llama3-8B-InstructFacebook AI研究实验室1,223+/-4104,642Facebook AI研究实验室Llama 3 Community
288CohereAIC4AI Aya Vision 8BCohereAI1,223+/-79,818CohereAICC-BY-NC-4.0
289DeepMindGemini-proDeepMind1,222+/-126,390DeepMindProprietary
290Mistralmistral-mediumMistral1,222+/-634,550MistralProprietary
291llama-3.1-tulu-3-8bAi21,220+/-112,896Ai2Llama 3.1
292Yi-1.5-34B零一万物1,212+/-524,146零一万物Apache-2.0
293zephyr-orpo-141b-A35b-v0.1HuggingFace1,212+/-114,652HuggingFaceApache 2.0
294Llama3.1-8B-InstructFacebook AI研究实验室1,211+/-449,605Facebook AI研究实验室Llama 3.1 Community
295Llama3.1-8B-InstructFacebook AI研究实验室1,208+/-113,090Facebook AI研究实验室Apache 2.0
296Alibabaqwen1.5-32b-chatAlibaba1,203+/-621,741AlibabaQianwen LICENSE
297OpenAIgpt-3.5-turbo-1106OpenAI1,202+/-916,619OpenAIProprietary
298Googlegemma-2-2b-itGoogle1,200+/-446,616GoogleGemma license
299Microsoft AzurePhi-3-medium 14B-previewMicrosoft Azure1,197+/-525,055Microsoft AzureMIT
300Mistralmixtral-8x7b-instruct-v0.1Mistral1,196+/-473,503MistralApache 2.0
301DBRX Instructdatabricks1,194+/-632,191databricksDBRX LICENSE
302InternLM2-Base-20B上海人工智能实验室1,191+/-79,901上海人工智能实验室Other
303Qwen1.5-14B-Chat阿里巴巴1,190+/-717,839阿里巴巴Qianwen LICENSE
304WizardLM-70B-V1.0WizardLM Team1,184+/-98,214WizardLM TeamLlama 2 Community
305DeepSeek-AIDeepSeek LLM 67B ChatDeepSeek-AI1,184+/-114,932DeepSeek-AIDeepSeek License
306Yi-34B零一万物1,183+/-715,483零一万物Yi License
307granite-3.0-8b-instructIBM1,182+/-96,638IBMApache 2.0
308openchat-3.5OpenChat1,182+/-107,968OpenChatApache-2.0
309openchat-3.5-0106OpenChat1,182+/-812,637OpenChatApache-2.0
310Google ResearchGemma 1.1-7B-ITGoogle Research1,181+/-623,893Google ResearchGemma license
311snowflake-arctic-instructSnowflake1,179+/-632,832SnowflakeApache 2.0
312granite-3.1-2b-instructIBM1,178+/-113,188IBMApache 2.0
313tulu-2-dpo-70bAllenAI/UW1,177+/-106,535AllenAI/UWAI2 ImpACT Low-risk
314openhermes-2.5-mistral-7bNousResearch1,175+/-105,006NousResearchApache-2.0
315Vicuna 33BLM-SYS1,172+/-622,479LM-SYSNon-commercial
316starling-lm-7b-betaNexusflow1,171+/-716,056NexusflowApache-2.0
317Microsoft AzurePhi-3-small 7BMicrosoft Azure1,170+/-617,766Microsoft AzureMIT
318Metallama-2-70b-chatMeta1,170+/-638,492MetaLlama 2 Community
319starling-lm-7b-alphaUC Berkeley1,167+/-810,224UC BerkeleyCC-BY-NC-4.0
320Metallama-3.2-3b-instructMeta1,166+/-87,936MetaLlama 3.2
321nous-hermes-2-mixtral-8x7b-dpoNousResearch1,164+/-123,777NousResearchApache-2.0
322Qwen3-VL-2B阿里巴巴1,156+/-86,837阿里巴巴Apache 2.0
323QwQ-32B-Preview阿里巴巴1,155+/-113,231阿里巴巴Apache 2.0
324Nvidiallama2-70b-steerlm-chatNvidia1,154+/-133,585NvidiaLlama 2 Community
325solar-10.7b-instruct-v1.0Upstage AI1,151+/-134,155Upstage AICC-BY-NC-4.0
326dolphin-2.2.1-mistral-7bCognitive Computations1,151+/-151,679Cognitive ComputationsApache-2.0
327MPT-30B-ChatMosaicML1,150+/-122,572MosaicMLCC-BY-NC-SA-4.0
328MistralAIMistral-7B-Instruct-v0.2MistralAI1,149+/-719,402MistralAIApache-2.0
329Microsoftwizardlm-13bMicrosoft1,148+/-97,044MicrosoftLlama 2 Community
330falcon-180b-chatTII1,147+/-171,295TIIFalcon-180B TII License
331Qwen1.5-7B-Chat阿里巴巴1,143+/-104,737阿里巴巴Qianwen LICENSE
332Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,142+/-612,297Microsoft AzureMIT
333Baichuan2-13B-Chat百川智能1,141+/-719,174百川智能Llama 2 Community
334Vicuna 13BLM-SYS1,140+/-719,367LM-SYSLlama 2 Community
335Qwen-14B-Chat阿里巴巴1,138+/-114,964阿里巴巴Qianwen LICENSE
336Google ResearchPaLM 2Google Research1,137+/-98,554Google ResearchProprietary
337Google ResearchGemma 7B - ItGoogle Research1,137+/-98,925Google ResearchGemma license
338CodeLLaMA-34BFacebook AI研究实验室1,136+/-97,366Facebook AI研究实验室Llama 2 Community
339zephyr-7b-betaHuggingFace1,130+/-911,118HuggingFaceMIT
340Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,129+/-720,685Microsoft AzureMIT
341Microsoft AzurePhi-3-mini 3.8BMicrosoft Azure1,127+/-620,118Microsoft AzureMIT
342guanaco-33bUW1,126+/-122,921UWNon-commercial
343zephyr-7b-alphaHuggingFace1,126+/-161,785HuggingFaceMIT
344stripedhyena-nous-7bTogether AI1,120+/-115,182Together AIApache 2.0
345CodeLlama-70B-InstructFacebook AI研究实验室1,118+/-181,143Facebook AI研究实验室Llama 2 Community
346Google ResearchGemma 1.1-2B-ITGoogle Research1,115+/-810,854Google ResearchGemma license
347Vicuna 7BLM-SYS1,114+/-96,923LM-SYSLlama 2 Community
348smollm2-1.7b-instructHuggingFace1,114+/-142,199HuggingFaceApache 2.0
349Metallama-3.2-1b-instructMeta1,110+/-88,045MetaLlama 3.2
350MistralAIMistral 7B InstructMistralAI1,109+/-98,977MistralAIApache 2.0
351Baichuan2-7B-Chat百川智能1,107+/-714,148百川智能Llama 2 Community
352Google ResearchGemma 2B - ItGoogle Research1,092+/-114,780Google ResearchGemma license
353Qwen1.5-4B-Chat阿里巴巴1,090+/-97,597阿里巴巴Qianwen LICENSE
354olmo-7b-instructAi21,073+/-116,328Ai2Apache-2.0
355Koala达摩院1,070+/-106,965达摩院Non-commercial
356alpaca-13bStanford1,068+/-115,745StanfordNon-commercial
357GPT4All 13BNomic AI1,066+/-151,743Nomic AINon-commercial
358MPT-7B-ChatMosaicML1,062+/-123,924MosaicMLCC-BY-NC-SA-4.0
359ChatGLM3-6B智谱AI1,055+/-124,658智谱AIApache-2.0
360RWKV-4-Raven-14BRWKV1,041+/-114,845RWKVApache 2.0
361ChatGLM2-6B智谱AI1,024+/-142,658智谱AIApache-2.0
362oasst-pythia-12bOpenAssistant1,022+/-116,310OpenAssistantApache 2.0
363ChatGLM-6B智谱AI995+/-134,914智谱AINon-commercial
364fastchat-t5-3bLMSYS991+/-124,203LMSYSApache 2.0
365dolly-v2-12bDatabricks980+/-143,412DatabricksMIT
366LLaMA 13BFacebook AI研究实验室973+/-162,391Facebook AI研究实验室Non-commercial
367stablelm-tuned-alpha-7bStability AI952+/-133,287Stability AICC-BY-NC-SA-4.0

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

FAQ

01

What is Text Generation Arena (LMArena)?

Text Generation Arena, formerly LMSYS Chatbot Arena, is one of the most widely followed anonymous LLM evaluation platforms. Users compare answers from two hidden models and vote for the better response; Elo-style scoring aggregates those votes into a dynamic leaderboard.

02

How is the Arena Elo score calculated?

Arena Elo is adapted from chess rating systems. After each head-to-head comparison, the preferred model gains rating points and the other model loses points, with the size of the change depending on the rating gap. The 95% confidence interval reflects how much comparison data supports the estimate.

03

Why do some models have both Thinking and regular versions?

Some models offer an extended-thinking mode that spends more inference time reasoning before producing the final answer. This can improve scores on reasoning, math, and coding tasks, but usually increases latency and cost, so Arena tracks these variants separately.

04

How should I choose an LLM from this leaderboard?

Consider overall Elo, cost, language coverage, open-source availability, and latency. The top-ranked model is not always the best fit for every workflow.