DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeOverall LeaderboardText-to-Video Arena Leaderboard

LMArena Tracks

Text GenerationCodingMathImage EditText-to-VideoImage-to-Video

Text-to-Video Arena Leaderboard

The latest AI video generation leaderboard based on Text-to-Video Arena anonymous user voting. Covers Elo scores, confidence intervals, and vote counts for leading video models.

Top Model

Veo 3.1 Generate (Preview)

Top Score

1,381

Model Count

37

Data version

2026年03月06日

Data source: LM Arena

About This Leaderboard

This leaderboard ranks AI text-to-video models by generation quality. Data comes from LMArena's Text-to-Video Arena track, evaluated through anonymous blind testing by real users.

Methodology Overview

Blind testing: Users submit text descriptions, two anonymous models generate videos, and users vote for the better result.

Elo scoring: Based on the Bradley-Terry model. Higher scores indicate stronger user preference for that model's video output.

Origin:AllChina
Leaderboard snapshot month:

Ranking Table

RankModelScore95% CIVotesOrganizationLicense
Google Deep MindVeo 3.1 Generate (Preview)

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

2026-03 Market Signals

Current Best (SOTA)

01

Veo 3.1 Audio 1080p

02

Veo 3.1 Fast-Audio 1080p

03

Sora-2-Pro

Best China Model

Wan2.6-T2V

Seedance-V1.5-Pro

Kling-2.6-Pro

Best Open Model

  • •Wan-V2.2-A14B
  • •Kandinsky-5.0-T2V-Pro
  • •Mochi-V1

FAQ

How does Text-to-Video Arena rank models?▼
Rankings are based on side-by-side anonymous votes. Users enter the same prompt, compare outputs from two hidden models, and choose the better video. Elo-style scoring then aggregates those comparisons into a leaderboard.
What is audio-video sync, and why does it matter?▼
Audio-video sync means generated sound effects or speech match the motion and timing in the video. It matters because synchronized audio can make generated clips usable with less post-production work.
Text-to-Image

Diverse generation scenarios: Covers natural landscapes, human motion, creative animation, product showcases, and more.

DataLearner provides in-depth analysis on top of the raw data, linking leaderboard models to the DataLearner model database so you can quickly access model details, API pricing, benchmark scores, and more.

Google Deep Mind
1,381
+/-16
5,537
Google Deep Mind
Proprietary
Google Deep MindVeo 3.1 Fast (Preview)Google Deep Mind1,378+/-145,743Google Deep MindProprietary
Google Deep MindVeo 3.1 Generate (Preview)Google Deep Mind1,371+/-1412,604Google Deep MindProprietary
4OpenAISora 2OpenAI1,367+/-918,963OpenAIProprietary
5Google Deep MindVeo 3.1 Fast (Preview)Google Deep Mind1,366+/-1125,377Google Deep MindProprietary
6xAIGrok Imagine 0.9xAI1,358+/-933,739xAIProprietary
7Google Deep MindVeo 3.1 Fast (Preview)Google Deep Mind1,351+/-1125,765Google Deep MindProprietary
8Alibabawan2.6-t2vAlibaba1,347+/-176,446AlibabaProprietary
9OpenAISora 2OpenAI1,342+/-825,157OpenAIProprietary
10Google Deep MindVeo 3.1 Generate (Preview)Google Deep Mind1,341+/-1219,331Google Deep MindProprietary
11阿里Wan2.1-T2V-14B阿里巴巴1,268+/-176,079阿里巴巴Proprietary
12Google Deep MindVeo 3.1 Generate (Preview)Google Deep Mind1,257+/-1115,176Google Deep MindProprietary
13字节Seedance 2.0字节跳动Seed团队1,255+/-831,616字节跳动Seed团队Proprietary
14Google Deep MindVeo 3.1 Fast (Preview)Google Deep Mind1,251+/-1215,453Google Deep MindProprietary
15PIpixverse-v5.6Pixverse1,228+/-142,275PixverseProprietary
16昆仑Kling 2.5 Turbo昆仑万维1,221+/-172,052昆仑万维Proprietary
17昆仑Kling 2.5 Turbo昆仑万维1,219+/-838,740昆仑万维Proprietary
18RUrunway-gen-4.5Runway1,214+/-113,932RunwayProprietary
19昆仑Kling 2.5 Turbo昆仑万维1,208+/-271,198昆仑万维Proprietary
20LUray-3Luma AI1,204+/-231,057Luma AIProprietary
21MiniMaxAIHailuo 2.3MiniMaxAI1,200+/-129,879MiniMaxAIProprietary
22MiniMaxAIHailuo 2.3MiniMaxAI1,196+/-826,762MiniMaxAIProprietary
23字节Seedance 2.0字节跳动Seed团队1,192+/-1112,882字节跳动Seed团队Proprietary
24MiniMaxAIHailuo 2.3MiniMaxAI1,181+/-129,931MiniMaxAIProprietary
25PRp-videoPruna1,180+/-153,589PrunaProprietary
26KAkandinsky-5.0-t2v-proKandinsky1,179+/-211,886KandinskyMIT
27腾讯Hunyuan-A13B-Instruct腾讯AI实验室1,171+/-164,097腾讯AI实验室tencent-hunyuan-community
28昆仑Kling 2.5 Turbo昆仑万维1,168+/-914,512昆仑万维Proprietary
29Google Deep MindVeo 3.1 Generate (Preview)Google Deep Mind1,166+/-167,098Google Deep MindProprietary
30阿里Wan2.1-T2V-14B阿里巴巴1,130+/-1511,158阿里巴巴Apache 2.0
31LIltx-2-19blightricks1,122+/-1021,120lightricksltx-2-community-license-agreement
32字节Seedance 2.0字节跳动Seed团队1,114+/-916,703字节跳动Seed团队Proprietary
33KAkandinsky-5.0-t2v-liteKandinsky1,112+/-181,353KandinskyMIT
34OpenAIsoraOpenAI1,071+/-144,516OpenAIProprietary
35LUray2Luma AI1,066+/-175,609Luma AIProprietary
36PIpika-v2.2Pika1,011+/-156,495PikaProprietary
37GEmochi-v1Genmo AI999+/-166,676Genmo AIApache 2.0
What use cases are text-to-video models good for?
▼
Common uses include short-form video creation, marketing assets, e-commerce product clips, storyboarding, game cinematics, and educational demos.
Which models support the longest generation length?▼
Long generation limits change quickly by product tier and release. In practice, check the current model documentation and compare both maximum duration and quality consistency across longer clips.