DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
Evaluation OverviewText Generation Arena 文本生成模型排行榜

LMArena Tracks

Text GenerationImage EditText-to-VideoImage-to-VideoText-to-Image

Text Generation Arena 文本生成模型排行榜

基于 Text Generation Arena 用户匿名投票的最新AI文本生成模型排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。

Top Model

claude-opus-4-6-thinking

Top Score

1,507

Model Count

60

Data version

2026年02月16日

Data source: LM Arena

About This Leaderboard

This leaderboard ranks the strongest AI models for text generation. Data comes from LMArena (formerly LMSYS Chatbot Arena), the world's largest crowdsourced AI evaluation platform. Users chat with two anonymous models side-by-side and vote for the better response — rankings are determined entirely by real user preferences, not lab benchmarks.

Methodology Overview

Blind testing: Users chat with two anonymous models and vote based on response quality, eliminating brand bias.

Elo scoring: Using the Bradley-Terry model (adapted from chess Elo ratings) to calculate each model's strength score from battle outcomes. Higher scores mean users more frequently prefer that model.

Broad scenario coverage: Testing spans coding, creative writing, math reasoning, Q&A, role-playing, and more.

DataLearner provides in-depth analysis on top of the raw data, linking leaderboard models to the DataLearner model database so you can quickly access model details, API pricing, benchmark scores, and more.

Text Generation Elo Score Ranking

Top 10

Chart Source: DataLearnerAI · Data Source: LMArena

Ranking Table

RankModelScore95% CIVotesOrganizationLicense
1claude-opus-4-6-thinking1,507+94,650AnthropicProprietary
2claude-opus-4-61,504+85,427AnthropicProprietary
3gemini-3-pro1,486+436,238GoogleProprietary
4grok-4.1-thinking1,475+435,770xAIProprietary
5gemini-3-flash1,473+526,986GoogleProprietary
6dola-seed-2.0-preview1,473+103,154BytedanceProprietary
7claude-opus-4-5-20251101-thinking-32k1,471+528,374AnthropicProprietary
8claude-opus-4-5-202511011,467+433,214AnthropicProprietary
9grok-4.11,463+439,883xAIProprietary
10gemini-3-flash (thinking-minimal)1,462+518,355GoogleProprietary
11gpt-5.1-high1,458+432,297OpenAIProprietary
12glm-51,455+94,643ZaiMIT
13ernie-5.0-01101,453+611,982BaiduProprietary
14claude-sonnet-4-5-20250929-thinking-32k1,450+446,773AnthropicProprietary
15claude-sonnet-4-5-202509291,450+444,565AnthropicProprietary
16gemini-2.5-pro1,449+395,526GoogleProprietary
17ernie-5.0-preview-12031,449+79,744BaiduProprietary
18claude-opus-4-1-20250805-thinking-16k1,449+449,819AnthropicProprietary
19kimi-k2.5-thinking1,448+79,050MoonshotModified MIT
20claude-opus-4-1-202508051,445+375,773AnthropicProprietary
21gpt-4.5-preview-2025-02-271,444+614,549OpenAIProprietary
22chatgpt-4o-latest-202503261,442+383,193OpenAIProprietary
23glm-4.71,441+611,971ZaiMIT
24gpt-5.2-high1,438+617,088OpenAIProprietary
25kimi-k2.5-instant1,438+95,007MoonshotModified MIT
26gpt-5.21,438+613,795OpenAIProprietary
27gpt-5.11,437+434,522OpenAIProprietary
28gpt-5-high1,434+532,559OpenAIProprietary
29qwen3-max-preview1,434+527,763AlibabaProprietary
30o3-2025-04-161,432+461,272OpenAIProprietary
31grok-4.1-fast-reasoning1,431+429,040xAIProprietary
32kimi-k2-thinking-turbo1,429+434,127MoonshotModified MIT
33gpt-5-chat1,426+431,753OpenAIProprietary
34glm-4.61,425+435,242ZaiMIT
35qwen3-max-2025-09-231,425+69,203AlibabaProprietary
36claude-opus-4-20250514-thinking-16k1,424+437,930AnthropicProprietary
37deepseek-v3.2-exp-thinking1,423+78,981DeepSeekMIT
38deepseek-v3.2-exp1,423+611,721DeepSeekMIT
39qwen3-235b-a22b-instruct-25071,423+369,847AlibabaApache 2.0
40grok-4-fast-chat1,422+86,983xAIProprietary
41deepseek-v3.2-thinking1,420+523,731DeepSeekMIT
42deepseek-v3.21,420+528,747DeepSeekMIT
43deepseek-r1-05281,419+619,281DeepSeekMIT
44ernie-5.0-preview-10221,419+94,594BaiduProprietary
45deepseek-v3.11,418+615,269DeepSeekMIT
46kimi-k2-0905-preview1,417+611,959MoonshotModified MIT
47deepseek-v3.1-thinking1,417+711,963DeepSeekMIT
48kimi-k2-0711-preview1,417+528,632MoonshotModified MIT
49deepseek-v3.1-terminus1,416+103,757DeepSeekMIT
50deepseek-v3.1-terminus-thinking1,416+103,547DeepSeekMIT
51qwen3-vl-235b-a22b-instruct1,415+611,653AlibabaApache 2.0
52mistral-large-31,414+524,945MistralApache 2.0
53gpt-4.1-2025-04-141,413+452,121OpenAIProprietary
54claude-opus-4-202505141,413+445,522AnthropicProprietary
55mistral-medium-25081,411+363,710MistralProprietary
56grok-3-preview-02-241,411+433,966xAIProprietary
57gemini-2.5-flash1,411+394,795GoogleProprietary
58glm-4.51,410+524,751ZaiMIT
59grok-4-07091,410+441,993xAIProprietary
60claude-haiku-4-5-202510011,406+445,273AnthropicProprietary

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.