DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeOverall LeaderboardText Generation Arena Leaderboard

LMArena Tracks

Text GenerationCodingMathImage EditText-to-VideoImage-to-VideoText-to-Image

Text Generation Arena Leaderboard

The latest AI text generation leaderboard based on LMArena anonymous user voting. Covers Elo scores, confidence intervals, and vote counts for leading language models.

Top Model

Opus 4.7 (thinking)

Top Score

1,503

Model Count

357

Data version

2026年05月07日

Data source: LM Arena

About This Leaderboard

This leaderboard ranks the strongest AI models for text generation. Data comes from LMArena (formerly LMSYS Chatbot Arena), the world's largest crowdsourced AI evaluation platform. Users chat with two anonymous models side-by-side and vote for the better response — rankings are determined entirely by real user preferences, not lab benchmarks.

Methodology Overview

Blind testing: Users chat with two anonymous models and vote based on response quality, eliminating brand bias.

Elo scoring: Using the Bradley-Terry model (adapted from chess Elo ratings) to calculate each model's strength score from battle outcomes. Higher scores mean users more frequently prefer that model.

Broad scenario coverage: Testing spans coding, creative writing, math reasoning, Q&A, role-playing, and more.

DataLearner provides in-depth analysis on top of the raw data, linking leaderboard models to the DataLearner model database so you can quickly access model details, API pricing, benchmark scores, and more.

Origin:AllChina
Leaderboard snapshot month:

Ranking Table

RankModelScore95% CIVotesOrganizationLicense
AnthropicOpus 4.7 (thinking)Anthropic1,503+/-68,945AnthropicProprietary
AnthropicClaude Opus 4.6 (thinking)Anthropic1,502+/-523,616AnthropicProprietary
AnthropicClaude Opus 4.6Anthropic1,498+/-525,089AnthropicProprietary
4Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1,492+/-429,468Google Deep MindProprietary
5AnthropicOpus 4.7Anthropic1,491+/-69,614AnthropicProprietary
6FAMuse SparkFacebook AI研究实验室1,490+/-610,491Facebook AI研究实验室Proprietary
7Google Deep MindGemini 3.0 Pro (Preview 11-2025)Google Deep Mind1,486+/-441,381Google Deep MindProprietary
8OpenAIgpt-5.5-highOpenAI1,484+/-76,488OpenAIProprietary
9xAIgrok-4.20-beta1xAI1,480+/-518,791xAIProprietary
10OpenAIgpt-5.2-chat-latest-20260210OpenAI1,477+/-523,717OpenAIProprietary
11OpenAIgpt-5.4-highOpenAI1,477+/-517,146OpenAIProprietary
12xAIgrok-4.20-beta-0309-reasoningxAI1,477+/-517,538xAIProprietary
13OpenAIGPT-5.5OpenAI1,475+/-76,653OpenAIProprietary
14Baiduernie-5.1Baidu1,474+/-85,733BaiduProprietary
15xAIgrok-4.20-multi-agent-beta-0309xAI1,474+/-517,728xAIProprietary
16Google Deep MindGemini 3.0 FlashGoogle Deep Mind1,474+/-430,784Google Deep MindProprietary
17AnthropicClaude Opus 4 (thinking-32k)Anthropic1,473+/-437,168AnthropicProprietary
18OpenAIgpt-5.5-instantOpenAI1,473+/-112,833OpenAIProprietary
19智谱GLM 5.1智谱AI1,471+/-611,349智谱AIMIT
20AnthropicClaude Opus 4Anthropic1,468+/-354,886AnthropicProprietary
21OpenAIGPT-5.4OpenAI1,468+/-517,925OpenAIProprietary
22xAIGrok 4.1 ThinkingxAI1,467+/-355,257xAIProprietary
23AnthropicClaude Sonnet 4.6Anthropic1,466+/-517,127AnthropicProprietary
24XImimo-v2.5-proXiaomi1,464+/-76,238XiaomiMIT
25Alibabaqwen3.5-max-previewAlibaba1,464+/-514,558AlibabaProprietary
26Google Deep MindGemini 3.0 Flash (minimal)Google Deep Mind1,463+/-441,346Google Deep MindProprietary
27DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1,463+/-94,160DeepSeek-AIMIT
28Moonshot AIKimi K2.6Moonshot AI1,462+/-77,108Moonshot AIModified MIT
29DeepSeekdeepseek-v4-pro-thinkingDeepSeek1,462+/-93,808DeepSeekMIT
30xAIGrok 4.1xAI1,459+/-359,206xAIProprietary
31Bytedancedola-seed-2.0-proBytedance1,459+/-526,587BytedanceProprietary
32阿里Qwen3.6-Max-Preview阿里巴巴1,457+/-93,965阿里巴巴Proprietary
33智谱GLM-5智谱AI1,457+/-520,292智谱AIMIT
34OpenAIgpt-5.4-mini-highOpenAI1,456+/-514,952OpenAIProprietary
35xAIGrok 4.3 BetaxAI1,455+/-85,234xAIProprietary
36OpenAIGPT-5.1 Pro (high)OpenAI1,455+/-440,891OpenAIProprietary
37AnthropicClaude Sonnet 4.5 (thinking-32k)Anthropic1,454+/-367,180AnthropicProprietary
38AnthropicClaude Sonnet 4.5Anthropic1,453+/-365,214AnthropicProprietary
39DeepMindGemma 4 31BDeepMind1,451+/-85,827DeepMindApache 2.0
40百度ERNIE 5.0百度1,450+/-428,724百度Proprietary
41Moonshot AIKimi K2 ThinkingMoonshot AI1,449+/-427,282Moonshot AIModified MIT
42百度ERNIE 5.0百度1,449+/-79,764百度Proprietary
43AnthropicOpus 4.1 (thinking-16k)Anthropic1,449+/-349,850AnthropicProprietary
44OpenAIgpt-5.3-chat-latestOpenAI1,449+/-522,474OpenAIProprietary
45Google Deep MindGemini 2.5 Pro Experimental 03-25Google Deep Mind1,448+/-3114,865Google Deep MindProprietary
46阿里Qwen 3.6 Plus Preview阿里巴巴1,448+/-68,683阿里巴巴Proprietary
47AnthropicOpus 4.1Anthropic1,447+/-377,425AnthropicProprietary
48XImimo-v2-proXiaomi1,447+/-515,257XiaomiProprietary
49阿里Qwen3.5-397B-A17B阿里巴巴1,446+/-522,471阿里巴巴Apache 2.0
50OpenAIGPT-4.5OpenAI1,444+/-614,547OpenAIProprietary
51OpenAIchatgpt-4o-latest-20250326OpenAI1,443+/-382,527OpenAIProprietary
52智谱GLM-4.7智谱AI1,443+/-612,142智谱AIMIT
53DeepSeekdeepseek-v4-flash-thinkingDeepSeek1,440+/-93,600DeepSeekMIT
54OpenAIGPT-5.2 Pro (high)OpenAI1,440+/-438,067OpenAIProprietary
55OpenAIGPT-5.1 InstantOpenAI1,439+/-443,533OpenAIProprietary
56Googlegemini-3.1-flash-lite-previewGoogle1,438+/-523,715GoogleProprietary
57DeepMindGemma 4 26B A4BDeepMind1,438+/-85,782DeepMindApache 2.0
58OpenAIGPT-5.2OpenAI1,437+/-435,182OpenAIProprietary
59阿里Qwen3 Max (Preview)阿里巴巴1,435+/-527,743阿里巴巴Proprietary
60Meituanlongcat-flash-chat-2602-expMeituan1,434+/-613,311MeituanProprietary
61OpenAIGPT-5-Pro (high)OpenAI1,434+/-531,963OpenAIProprietary
62DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1,433+/-93,506DeepSeek-AIMIT
63Moonshotkimi-k2.5-instantMoonshot1,432+/-78,207MoonshotModified MIT
64xAIgrok-4-1-fast-reasoningxAI1,432+/-350,028xAIProprietary
65OpenAIOpenAI o3OpenAI1,431+/-459,783OpenAIProprietary
66Moonshot AIKimi K2 Thinking (thinking-turbo)Moonshot AI1,430+/-352,935Moonshot AIModified MIT
67Amazonamazon-nova-experimental-chat-26-02-10Amazon1,428+/-103,424AmazonProprietary
68OpenAIGPT-5OpenAI1,426+/-431,617OpenAIProprietary
69智谱GLM-4.6智谱AI1,426+/-435,694智谱AIMIT
70DeepSeek-AIDeepSeek V3.2-Exp (thinking)DeepSeek-AI1,425+/-79,076DeepSeek-AIMIT
71DeepSeek-AIDeepSeek V3.2DeepSeek-AI1,424+/-444,820DeepSeek-AIMIT
72Alibabaqwen3-max-2025-09-23Alibaba1,424+/-69,179AlibabaProprietary
73AnthropicClaude Opus 4 (thinking-16k)Anthropic1,424+/-436,937AnthropicProprietary
74DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1,423+/-611,943DeepSeek-AIMIT
75XImimo-v2.5Xiaomi1,423+/-76,300XiaomiMIT
76阿里Qwen3-235B-A22B-2507阿里巴巴1,423+/-388,518阿里巴巴Apache 2.0
77DeepSeek-AIDeepSeek V3.2 (thinking)DeepSeek-AI1,422+/-439,071DeepSeek-AIMIT
78DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1,422+/-618,469DeepSeek-AIMIT
79xAIGrok 4 FastxAI1,421+/-86,823xAIProprietary
80百度ERNIE 5.0百度1,419+/-94,715百度Proprietary
81阿里Qwen3.5-122B-A10B阿里巴巴1,418+/-519,379阿里巴巴Apache 2.0
82Tencenthunyuan-hy3-previewTencent1,418+/-84,582Tencenttencent-hunyuan-community
83Moonshot AIKimi K2 0905Moonshot AI1,418+/-611,798Moonshot AIModified MIT
84DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1,418+/-614,985DeepSeek-AIMIT
85Moonshot AIKimi K2Moonshot AI1,417+/-527,644Moonshot AIModified MIT
86DeepSeekdeepseek-v3.1-terminus-thinkingDeepSeek1,417+/-103,474DeepSeekMIT
87DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1,417+/-711,754DeepSeek-AIMIT
88DeepSeek-AIDeepSeek-V3.1 TerminusDeepSeek-AI1,416+/-103,713DeepSeek-AIMIT
89阿里Qwen3-VL-235B-A22B-Instruct阿里巴巴1,415+/-611,529阿里巴巴Apache 2.0
90Amazonamazon-nova-experimental-chat-26-01-10Amazon1,415+/-103,418AmazonProprietary
91MistralAIMistral Large 3MistralAI1,415+/-441,365MistralAIApache 2.0
92OpenAIGPT-4.1OpenAI1,413+/-451,035OpenAIProprietary
93AnthropicClaude Opus 4Anthropic1,412+/-444,244AnthropicProprietary
94xAIGrok 3xAI1,412+/-432,916xAIProprietary
95智谱GLM-4.5智谱AI1,411+/-524,336智谱AIMIT
96Google Deep MindGemini 2.5 FlashGoogle Deep Mind1,411+/-3114,591Google Deep MindProprietary
97xAIGrok 4xAI1,410+/-441,416xAIProprietary
98MistralAIMagistral-Medium-2506MistralAI1,409+/-384,463MistralAIProprietary
99AnthropicHaiku 4.5Anthropic1,409+/-367,007AnthropicProprietary
100MiniMaxAIMiniMax-M2.7MiniMaxAI1,407+/-613,525MiniMaxAIModified MIT
101阿里Qwen3.5-27B阿里巴巴1,406+/-518,942阿里巴巴Apache 2.0
102OpenAIgpt-5.4-nano-highOpenAI1,406+/-514,363OpenAIProprietary
103Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1,405+/-432,938Google Deep MindProprietary
104xAIgrok-4-fast-reasoningxAI1,404+/-518,737xAIProprietary
105Alibabaqwen3-235b-a22b-no-thinkingAlibaba1,403+/-538,241AlibabaApache 2.0
106阿里Qwen3-Next阿里巴巴1,402+/-522,883阿里巴巴Apache 2.0
107OpenAIo1-2024-12-17OpenAI1,402+/-427,807OpenAIProprietary
108Meituanlongcat-flash-chatMeituan1,401+/-611,409MeituanMIT
109Alibabaqwen3-235b-a22b-thinking-2507Alibaba1,399+/-79,004AlibabaApache 2.0
110AnthropicClaude Sonnet 4 (thinking-32k)Anthropic1,399+/-435,132AnthropicProprietary
111StepFunAIStep 3.5 FlashStepFunAI1,398+/-519,649StepFunAIProprietary
112DeepSeek-AIDeepSeek-R1DeepSeek-AI1,398+/-518,524DeepSeek-AIMIT
113阿里Qwen3.5-35B-A3B阿里巴巴1,397+/-519,774阿里巴巴Apache 2.0
114Tencenthunyuan-vision-1.5-thinkingTencent1,396+/-122,221TencentProprietary
115阿里Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴1,396+/-77,944阿里巴巴Apache 2.0
116Amazonamazon-nova-experimental-chat-12-10Amazon1,395+/-103,690AmazonProprietary
117DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1,395+/-445,533DeepSeek-AIMIT
118MiniMaxAIMiniMax M2.5MiniMaxAI1,395+/-424,885MiniMaxAIModified MIT
119StepFunAIStep 3.5 FlashStepFunAI1,393+/-425,112StepFunAIApache 2.0
120XImimo-v2-flash (non-thinking)Xiaomi1,393+/-437,247XiaomiMIT
121Microsoft AImai-1-previewMicrosoft AI1,393+/-517,899Microsoft AIProprietary
122OpenAIgpt-5-mini-highOpenAI1,390+/-527,053OpenAIProprietary
123OpenAIOpenAI o4 - miniOpenAI1,390+/-445,463OpenAIProprietary
124AnthropicClaude Sonnet 4Anthropic1,389+/-440,351AnthropicProprietary
125OpenAIOpenAI o1OpenAI1,388+/-531,122OpenAIProprietary
126XImimo-v2-flash (thinking)Xiaomi1,388+/-610,982XiaomiMIT
127腾讯Hunyuan-T1腾讯AI实验室1,387+/-94,710腾讯AI实验室Proprietary
128阿里Qwen3-Coder-480B-A35B阿里巴巴1,387+/-525,757阿里巴巴Apache 2.0
129AnthropicClaude Sonnet 3.7 (thinking-32k)Anthropic1,387+/-438,841AnthropicProprietary
130Mistralmistral-medium-2505Mistral1,386+/-533,244MistralProprietary
131MiniMaxAIM2.1MiniMaxAI1,385+/-517,165MiniMaxAIMIT
132阿里Qwen3-30B-A3B-2507阿里巴巴1,383+/-523,766阿里巴巴Apache 2.0
133OpenAIGPT-4.1 miniOpenAI1,382+/-439,353OpenAIProprietary
134Tencenthunyuan-turbos-20250416Tencent1,382+/-610,723TencentProprietary
135ARtrinity-large-thinkingArcee AI1,380+/-612,239Arcee AIApache 2.0
136Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind1,380+/-347,285Google Deep MindProprietary
137智谱GLM-4.6V智谱AI1,378+/-112,810智谱AIMIT
138ARtrinity-large-previewArcee AI1,375+/-520,978Arcee AIApache 2.0
139阿里Qwen3-235B-A22B阿里巴巴1,375+/-526,284阿里巴巴Apache 2.0
140Google Deep MindGemini 2.5 Flash-Lite (thinking)Google Deep Mind1,374+/-532,947Google Deep MindProprietary
141阿里Qwen2.5-Max阿里巴巴1,374+/-432,625阿里巴巴Proprietary
142智谱GLM-4.5-Air智谱AI1,373+/-431,119智谱AIMIT
143AnthropicClaude 3.5 SonnetAnthropic1,372+/-388,359AnthropicProprietary
144AnthropicClaude Sonnet 3.7Anthropic1,371+/-443,206AnthropicProprietary
145阿里Qwen3-Next (thinking)阿里巴巴1,369+/-613,707阿里巴巴Apache 2.0
146智谱GLM-4.7-Flash智谱AI1,368+/-611,763智谱AIMIT
147Amazonamazon-nova-experimental-chat-11-10Amazon1,367+/-425,445AmazonProprietary
148Google Deep MindGemma 3 - 27B (IT)Google Deep Mind1,366+/-447,569Google Deep MindGemma
149MiniMaxminimax-m1MiniMax1,363+/-435,233MiniMaxApache 2.0
150OpenAIo3-mini-highOpenAI1,363+/-518,589OpenAIProprietary
151OpenAIOpenAI o3-mini (high)OpenAI1,362+/-516,977OpenAIProprietary
152Nvidianvidia-nemotron-3-super-120b-a12bNvidia1,361+/-77,419NvidiaNVIDIA Open Model
153DeepMindGemini 2.0 Flash ExperimentalDeepMind1,360+/-443,767DeepMindProprietary
154DeepSeek-AIDeepSeek-V3DeepSeek-AI1,358+/-521,770DeepSeek-AIDeepSeek
155MistralAIMistral-Small-3.2MistralAI1,357+/-517,716MistralAIApache 2.0
156xAIgrok-3-mini-betaxAI1,357+/-522,724xAIProprietary
157PRintellect-3Prime Intellect1,357+/-85,337Prime IntellectMIT
158CohereAIC4AI Command A (202503)CohereAI1,353+/-356,304CohereAICC-BY-NC-4.0
159智谱GLM-4.5V智谱AI1,353+/-84,965智谱AIMIT
160DeepMindGemini 2.0 Flash-LiteDeepMind1,353+/-424,955DeepMindProprietary
161OpenAIGPT OSS 120BOpenAI1,353+/-430,653OpenAIApache 2.0
162Google Deep MindGemini 1.5 ProGoogle Deep Mind1,351+/-355,606Google Deep MindProprietary
163Amazonamazon-nova-experimental-chat-10-20Amazon1,350+/-611,479AmazonProprietary
164Tencenthunyuan-turbos-20250226Tencent1,348+/-122,220TencentProprietary
165StepFunAIStep3StepFunAI1,348+/-76,551StepFunAIApache 2.0
166Amazonamazon-nova-experimental-chat-10-09Amazon1,348+/-112,839AmazonProprietary
167OpenAIo3-miniOpenAI1,347+/-457,364OpenAIProprietary
168阿里Qwen3-32B阿里巴巴1,347+/-93,926阿里巴巴Apache 2.0
169Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia1,347+/-122,549NvidiaNvidia Open Model
170INmercury-2Inception AI1,347+/-113,135Inception AIProprietary
171INling-flash-2.0InclusionAI1,346+/-77,015InclusionAIMIT
172MiniMaxAIMiniMax M2MiniMaxAI1,346+/-86,871MiniMaxAIApache 2.0
173Alibabaqwen-plus-0125Alibaba1,346+/-85,819AlibabaProprietary
174OpenAIGPT-4oOpenAI1,345+/-3112,881OpenAIProprietary
175Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia1,343+/-103,346NvidiaNvidia Open
176ZHglm-4-plus-0111Zhipu1,343+/-85,760ZhipuProprietary
177AnthropicClaude 3.5 SonnetAnthropic1,342+/-382,419AnthropicProprietary
178Google Deep MindGemma 3 - 12B (IT)Google Deep Mind1,342+/-103,829Google Deep MindGemma
179Tencenthunyuan-turbo-0110Tencent1,340+/-122,290TencentProprietary
180OpenAIgpt-5-nano-highOpenAI1,337+/-78,274OpenAIProprietary
181亚马Nova 2 Lite亚马逊1,337+/-612,250亚马逊Proprietary
182OpenAIOpenAI o1-miniOpenAI1,337+/-451,981OpenAIProprietary
183阿里QwQ-32B阿里巴巴1,336+/-425,401阿里巴巴Apache 2.0
184xAIGrok 2xAI1,335+/-463,498xAIProprietary
185Googlegemini-advanced-0514Google1,335+/-550,148GoogleProprietary
186OpenAIGPT-4oOpenAI1,334+/-445,499OpenAIProprietary
187Metallama-3.1-405b-instruct-bf16Meta1,334+/-441,375MetaLlama 3.1 Community
188StepFunstep-2-16k-exp-202412StepFun1,334+/-94,833StepFunProprietary
189Metallama-3.1-405b-instruct-fp8Meta1,333+/-459,656MetaLlama 3.1 Community
190AIolmo-3.1-32b-instructAi21,330+/-612,240Ai2Apache 2.0
191AImolmo-2-8bAi21,329+/-21802Ai2Apache 2.0
19201yi-lightning01 AI1,328+/-527,33201 AIProprietary
193Nvidiallama-3.3-nemotron-49b-super-v1Nvidia1,327+/-122,218NvidiaNvidia
194阿里Qwen3-30B-A3B阿里巴巴1,327+/-526,502阿里巴巴Apache 2.0
195Metallama-4-maverick-17b-128e-instructMeta1,327+/-439,996MetaLlama 4
196Tencenthunyuan-large-2025-02-10Tencent1,326+/-103,738TencentProprietary
197OpenAIgpt-4-turbo-2024-04-09OpenAI1,324+/-498,114OpenAIProprietary
198DeepSeekdeepseek-v2.5-1210DeepSeek1,323+/-86,795DeepSeekDeepSeek
199AnthropicClaude 3.5 HaikuAnthropic1,323+/-370,017AnthropicProprietary
200Google Deep MindGemini 1.5 ProGoogle Deep Mind1,323+/-479,138Google Deep MindProprietary
201Metallama-4-scout-17b-16e-instructMeta1,322+/-530,310MetaLlama
202OpenAIgpt-4.1-nano-2025-04-14OpenAI1,322+/-86,103OpenAIProprietary
203AnthropicClaude3-OpusAnthropic1,321+/-3194,909AnthropicProprietary
204INring-flash-2.0InclusionAI1,321+/-77,156InclusionAIMIT
205StepFunstep-1o-turbo-202506StepFun1,320+/-79,039StepFunProprietary
206ZHglm-4-plusZhipu AI1,319+/-526,126Zhipu AIProprietary
207Googlegemma-3n-e4b-itGoogle1,318+/-522,610GoogleGemma
208Metallama-3.3-70b-instructMeta1,318+/-354,748MetaLlama-3.3
209Alibabaqwen-max-0919Alibaba1,318+/-616,478AlibabaQwen
210OpenAIgpt-4o-mini-2024-07-18OpenAI1,317+/-468,710OpenAIProprietary
211OpenAIgpt-oss-20bOpenAI1,317+/-610,637OpenAIApache 2.0
212Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia1,317+/-615,530NvidiaNVIDIA Open Model
213Alibabaqwen2.5-plus-1127Alibaba1,315+/-610,187AlibabaProprietary
214NEathene-v2-chatNexusFlow1,314+/-524,739NexusFlowNexusFlow
215Mistralmistral-large-2407Mistral1,313+/-445,459MistralMistral Research
216OpenAIgpt-4-0125-previewOpenAI1,312+/-493,439OpenAIProprietary
217OpenAIgpt-4-1106-previewOpenAI1,312+/-4100,105OpenAIProprietary
218Tencenthunyuan-standard-2025-02-10Tencent1,311+/-103,904TencentProprietary
219Googlegemini-1.5-flash-002Google1,309+/-434,902GoogleProprietary
220xAIgrok-2-mini-2024-08-13xAI1,308+/-452,567xAIProprietary
221DeepSeekdeepseek-v2.5DeepSeek1,307+/-524,572DeepSeekDeepSeek
222INmercuryInception AI1,306+/-141,958Inception AIProprietary
223NEathene-70b-0725NexusFlow1,306+/-619,621NexusFlowCC-BY-NC-4.0
224AIolmo-3-32b-thinkAi21,305+/-85,962Ai2Apache 2.0
225Mistralmistral-large-2411Mistral1,305+/-428,073MistralMRL
226Mistralmagistral-medium-2506Mistral1,303+/-611,646MistralProprietary
227Googlegemma-3-4b-itGoogle1,303+/-94,171GoogleGemma
228Mistralmistral-small-3.1-24b-instruct-2503Mistral1,303+/-533,231MistralApache 2.0
229Alibabaqwen2.5-72b-instructAlibaba1,302+/-439,406AlibabaQwen
230Nvidiallama-3.1-nemotron-70b-instructNvidia1,299+/-87,140NvidiaLlama 3.1
231Tencenthunyuan-large-visionTencent1,294+/-95,370TencentProprietary
232Metallama-3.1-70b-instructMeta1,293+/-455,240MetaLlama 3.1 Community
233Amazonamazon-nova-pro-v1.0Amazon1,290+/-524,745AmazonProprietary
234AIjamba-1.5-largeAI21 Labs1,288+/-78,662AI21 LabsJamba Open
235Googlegemma-2-27b-itGoogle1,288+/-375,754GoogleGemma license
236REreka-core-20240904Reka AI1,287+/-77,312Reka AIProprietary
237IBibm-granite-h-smallIBM1,287+/-85,679IBMApache 2.0
238OpenAIgpt-4-0314OpenAI1,286+/-554,173OpenAIProprietary
239AIllama-3.1-tulu-3-70bAi21,286+/-102,846Ai2Llama 3.1
240Nvidiallama-3.1-nemotron-51b-instructNvidia1,285+/-103,749NvidiaLlama 3.1
241Googlegemini-1.5-flash-001Google1,285+/-462,833GoogleProprietary
242AIolmo-3.1-32b-thinkAi21,285+/-78,512Ai2Apache 2.0
243Anthropicclaude-3-sonnet-20240229Anthropic1,280+/-4109,284AnthropicProprietary
244PRgemma-2-9b-it-simpoPrinceton1,279+/-710,072PrincetonMIT
245Nvidianemotron-4-340b-instructNvidia1,276+/-519,659NvidiaNVIDIA Open Model
246Coherecommand-r-plus-08-2024Cohere1,276+/-79,866CohereCC-BY-NC-4.0
247Metallama-3-70b-instructMeta1,275+/-4156,876MetaLlama 3 Community
248OpenAIgpt-4-0613OpenAI1,274+/-488,723OpenAIProprietary
249Mistralmistral-small-24b-instruct-2501Mistral1,274+/-614,681MistralApache 2.0
250ZHglm-4-0520Zhipu AI1,273+/-79,788Zhipu AIProprietary
251REreka-flash-20240904Reka AI1,271+/-77,536Reka AIProprietary
252Alibabaqwen2.5-coder-32b-instructAlibaba1,270+/-85,432AlibabaApache 2.0
253Coherec4ai-aya-expanse-32bCohere1,266+/-527,124CohereCC-BY-NC-4.0
254Googlegemma-2-9b-itGoogle1,265+/-454,611GoogleGemma license
255DeepSeekdeepseek-coder-v2DeepSeek1,264+/-615,147DeepSeekDeepSeek License
256Coherecommand-r-plusCohere1,261+/-477,554CohereCC-BY-NC-4.0
257Alibabaqwen2-72b-instructAlibaba1,261+/-537,325AlibabaQianwen LICENSE
258Anthropicclaude-3-haiku-20240307Anthropic1,260+/-4117,701AnthropicProprietary
259Amazonamazon-nova-lite-v1.0Amazon1,260+/-519,372AmazonProprietary
260Googlegemini-1.5-flash-8b-001Google1,258+/-435,558GoogleProprietary
261Microsoft AzurePhi 4 - 14BMicrosoft Azure1,256+/-524,126Microsoft AzureMIT
262AIolmo-2-0325-32b-instructAi21,251+/-113,334Ai2Apache-2.0
263Coherecommand-r-08-2024Cohere1,249+/-710,140CohereCC-BY-NC-4.0
264Mistralmistral-large-2402Mistral1,241+/-562,436MistralProprietary
265Amazonamazon-nova-micro-v1.0Amazon1,240+/-519,364AmazonProprietary
266AIjamba-1.5-miniAI21 Labs1,239+/-78,858AI21 LabsJamba Open
267Mistralministral-8b-2410Mistral1,237+/-94,781MistralMRL
268Googlegemini-pro-dev-apiGoogle1,235+/-718,354GoogleProprietary
269Alibabaqwen1.5-110b-chatAlibaba1,233+/-626,195AlibabaQianwen LICENSE
270Tencenthunyuan-standard-256kTencent1,233+/-122,728TencentProprietary
271REreka-flash-21b-20240226-onlineReka AI1,232+/-715,450Reka AIProprietary
272Alibabaqwen1.5-72b-chatAlibaba1,232+/-539,302AlibabaQianwen LICENSE
273Mistralmixtral-8x22b-instruct-v0.1Mistral1,228+/-551,416MistralApache 2.0
274Coherecommand-rCohere1,226+/-554,036CohereCC-BY-NC-4.0
275REreka-flash-21b-20240226Reka AI1,226+/-624,806Reka AIProprietary
276OpenAIgpt-3.5-turbo-0125OpenAI1,223+/-566,207OpenAIProprietary
277Metallama-3-8b-instructMeta1,222+/-4104,642MetaLlama 3 Community
278Coherec4ai-aya-expanse-8bCohere1,222+/-79,818CohereCC-BY-NC-4.0
279Mistralmistral-mediumMistral1,222+/-634,550MistralProprietary
280Googlegemini-proGoogle1,221+/-126,390GoogleProprietary
281AIllama-3.1-tulu-3-8bAi21,220+/-112,896Ai2Llama 3.1
28201yi-1.5-34b-chat01 AI1,212+/-524,14601 AIApache-2.0
283HUzephyr-orpo-141b-A35b-v0.1HuggingFace1,212+/-114,652HuggingFaceApache 2.0
284Metallama-3.1-8b-instructMeta1,211+/-449,605MetaLlama 3.1 Community
285IBgranite-3.1-8b-instructIBM1,207+/-113,090IBMApache 2.0
286Alibabaqwen1.5-32b-chatAlibaba1,203+/-621,741AlibabaQianwen LICENSE
287OpenAIgpt-3.5-turbo-1106OpenAI1,202+/-916,619OpenAIProprietary
288Googlegemma-2-2b-itGoogle1,199+/-446,616GoogleGemma license
289Microsoftphi-3-medium-4k-instructMicrosoft1,197+/-525,055MicrosoftMIT
290Mistralmixtral-8x7b-instruct-v0.1Mistral1,196+/-473,503MistralApache 2.0
291DAdbrx-instruct-previewDatabricks1,194+/-632,191DatabricksDBRX LICENSE
292INinternlm2_5-20b-chatInternLM1,191+/-79,901InternLMOther
293Alibabaqwen1.5-14b-chatAlibaba1,190+/-717,839AlibabaQianwen LICENSE
294Microsoftwizardlm-70bMicrosoft1,184+/-98,214MicrosoftLlama 2 Community
295DeepSeekdeepseek-llm-67b-chatDeepSeek1,183+/-124,932DeepSeekDeepSeek License
29601yi-34b-chat01 AI1,183+/-715,48301 AIYi License
297OPopenchat-3.5-0106OpenChat1,181+/-812,637OpenChatApache-2.0
298OPopenchat-3.5OpenChat1,181+/-107,968OpenChatApache-2.0
299IBgranite-3.0-8b-instructIBM1,181+/-96,638IBMApache 2.0
300Googlegemma-1.1-7b-itGoogle1,180+/-623,893GoogleGemma license
301SNsnowflake-arctic-instructSnowflake1,178+/-632,832SnowflakeApache 2.0
302IBgranite-3.1-2b-instructIBM1,178+/-113,188IBMApache 2.0
303ALtulu-2-dpo-70bAllenAI/UW1,177+/-106,535AllenAI/UWAI2 ImpACT Low-risk
304NOopenhermes-2.5-mistral-7bNousResearch1,174+/-105,006NousResearchApache-2.0
305LMvicuna-33bLMSYS1,172+/-622,479LMSYSNon-commercial
306NEstarling-lm-7b-betaNexusflow1,171+/-716,056NexusflowApache-2.0
307Microsoftphi-3-small-8k-instructMicrosoft1,170+/-617,766MicrosoftMIT
308Metallama-2-70b-chatMeta1,170+/-638,492MetaLlama 2 Community
309UCstarling-lm-7b-alphaUC Berkeley1,166+/-810,224UC BerkeleyCC-BY-NC-4.0
310Metallama-3.2-3b-instructMeta1,166+/-87,936MetaLlama 3.2
311NOnous-hermes-2-mixtral-8x7b-dpoNousResearch1,164+/-123,777NousResearchApache-2.0
312Alibabaqwq-32b-previewAlibaba1,156+/-123,231AlibabaApache 2.0
313IBgranite-3.0-2b-instructIBM1,155+/-86,837IBMApache 2.0
314Nvidiallama2-70b-steerlm-chatNvidia1,154+/-133,585NvidiaLlama 2 Community
315UPsolar-10.7b-instruct-v1.0Upstage AI1,151+/-134,155Upstage AICC-BY-NC-4.0
316COdolphin-2.2.1-mistral-7bCognitive Computations1,151+/-151,679Cognitive ComputationsApache-2.0
317MOmpt-30b-chatMosaicML1,149+/-122,572MosaicMLCC-BY-NC-SA-4.0
318Mistralmistral-7b-instruct-v0.2Mistral1,148+/-719,402MistralApache-2.0
319Microsoftwizardlm-13bMicrosoft1,148+/-97,044MicrosoftLlama 2 Community
320TIfalcon-180b-chatTII1,146+/-171,295TIIFalcon-180B TII License
321Alibabaqwen1.5-7b-chatAlibaba1,143+/-104,737AlibabaQianwen LICENSE
322Microsoftphi-3-mini-4k-instruct-june-2024Microsoft1,142+/-612,297MicrosoftMIT
323Metallama-2-13b-chatMeta1,141+/-719,174MetaLlama 2 Community
324LMvicuna-13bLMSYS1,140+/-719,367LMSYSLlama 2 Community
325Alibabaqwen-14b-chatAlibaba1,137+/-114,964AlibabaQianwen LICENSE
326Googlepalm-2Google1,136+/-98,554GoogleProprietary
327Googlegemma-7b-itGoogle1,136+/-108,925GoogleGemma license
328Metacodellama-34b-instructMeta1,136+/-97,366MetaLlama 2 Community
329HUzephyr-7b-betaHuggingFace1,130+/-911,118HuggingFaceMIT
330Microsoftphi-3-mini-128k-instructMicrosoft1,128+/-720,685MicrosoftMIT
331Microsoftphi-3-mini-4k-instructMicrosoft1,127+/-620,118MicrosoftMIT
332UWguanaco-33bUW1,126+/-122,921UWNon-commercial
333HUzephyr-7b-alphaHuggingFace1,126+/-161,785HuggingFaceMIT
334TOstripedhyena-nous-7bTogether AI1,120+/-115,182Together AIApache 2.0
335Metacodellama-70b-instructMeta1,118+/-181,143MetaLlama 2 Community
336Googlegemma-1.1-2b-itGoogle1,114+/-810,854GoogleGemma license
337LMvicuna-7bLMSYS1,114+/-96,923LMSYSLlama 2 Community
338HUsmollm2-1.7b-instructHuggingFace1,113+/-142,199HuggingFaceApache 2.0
339Metallama-3.2-1b-instructMeta1,110+/-88,045MetaLlama 3.2
340Mistralmistral-7b-instructMistral1,109+/-98,977MistralApache 2.0
341Metallama-2-7b-chatMeta1,107+/-714,148MetaLlama 2 Community
342Googlegemma-2b-itGoogle1,091+/-124,780GoogleGemma license
343Alibabaqwen1.5-4b-chatAlibaba1,089+/-97,597AlibabaQianwen LICENSE
344AIolmo-7b-instructAi21,073+/-116,328Ai2Apache-2.0
345UCkoala-13bUC Berkeley1,069+/-106,965UC BerkeleyNon-commercial
346STalpaca-13bStanford1,067+/-115,745StanfordNon-commercial
347NOgpt4all-13b-snoozyNomic AI1,065+/-151,743Nomic AINon-commercial
348MOmpt-7b-chatMosaicML1,061+/-123,924MosaicMLCC-BY-NC-SA-4.0
349TSchatglm3-6bTsinghua1,055+/-124,658TsinghuaApache-2.0
350RWRWKV-4-Raven-14BRWKV1,040+/-114,845RWKVApache 2.0
351TSchatglm2-6bTsinghua1,023+/-142,658TsinghuaApache-2.0
352OPoasst-pythia-12bOpenAssistant1,021+/-116,310OpenAssistantApache 2.0
353TSchatglm-6bTsinghua994+/-134,914TsinghuaNon-commercial
354LMfastchat-t5-3bLMSYS990+/-124,203LMSYSApache 2.0
355DAdolly-v2-12bDatabricks979+/-143,412DatabricksMIT
356Metallama-13bMeta972+/-162,391MetaNon-commercial
357STstablelm-tuned-alpha-7bStability AI952+/-133,287Stability AICC-BY-NC-SA-4.0

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

FAQ

01

What is Text Generation Arena (LMArena)?

Text Generation Arena, formerly LMSYS Chatbot Arena, is one of the most widely followed anonymous LLM evaluation platforms. Users compare answers from two hidden models and vote for the better response; Elo-style scoring aggregates those votes into a dynamic leaderboard.

02

How is the Arena Elo score calculated?

Arena Elo is adapted from chess rating systems. After each head-to-head comparison, the preferred model gains rating points and the other model loses points, with the size of the change depending on the rating gap. The 95% confidence interval reflects how much comparison data supports the estimate.

03

Why do some models have both Thinking and regular versions?

Some models offer an extended-thinking mode that spends more inference time reasoning before producing the final answer. This can improve scores on reasoning, math, and coding tasks, but usually increases latency and cost, so Arena tracks these variants separately.

04

How should I choose an LLM from this leaderboard?

Consider overall Elo, cost, language coverage, open-source availability, and latency. The top-ranked model is not always the best fit for every workflow.