DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Arcada Labs Code Categories Arena 代码能力排行榜

Arcada Labs Code Categories Arena 代码能力排行榜

基于 Arcada Labs Code Categories Arena 用户匿名投票的最新AI大模型代码能力排行榜,通过 Bradley-Terry 模型对 Website、UI Component、Game Dev、Data Visualization 等代码子类别进行综合评分与排名。

榜首模型

Claude Fable 5

最高得分

1352.00

模型数量

129

数据版本

2026年06月13日

数据来源: Arcada Labs

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称得分95% CI投票数机构许可证
AnthropicClaude Fable 5Anthropic1352.00+/-12.13,528AnthropicProprietary
AnthropicClaude Opus 4.6Anthropic1343.00+/-5.717,547AnthropicProprietary
AnthropicOpus 4.7 (thinking)Anthropic1339.00+/-7.59,695AnthropicProprietary
4AnthropicClaude Opus 4.6 (thinking)Anthropic1337.00+/-6.214,857AnthropicProprietary
5智谱GLM 5.1智谱AI1332.00+/-105,306智谱AIOpen Source
6Moonshot AIKimi K2.6Moonshot AI1332.00+/-5.419,693Moonshot AIOpen Source
7智谱GLM-5-Turbo智谱AI1326.00+/-5.122,226智谱AIProprietary
8AnthropicOpus 4.7Anthropic1325.00+/-6.513,174AnthropicProprietary
9AnthropicClaude Sonnet 4.6Anthropic1325.00+/-5.816,730AnthropicProprietary
10XIMiMo V2.5 ProXiaomi1323.00+/-11.63,820XiaomiOpen Source
11MiniMaxMiniMax M3MiniMax1315.00+/-9.25,954MiniMaxOpen Source
12阿里Qwen3.7 Max阿里巴巴1312.00+/-7.49,699阿里巴巴Proprietary
13XIMiMo V2.5Xiaomi1305.00+/-4.825,579XiaomiOpen Source
14FAMuse SparkFacebook AI研究实验室1303.00+/-10.94,249Facebook AI研究实验室Proprietary
15Google Deep MindGemini 3.5 FlashGoogle Deep Mind1299.00+/-7.78,856Google Deep MindProprietary
16OpenAIGPT-5.5OpenAI1299.00+/-7.110,232OpenAIProprietary
17DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI1297.00+/-6.612,237DeepSeek-AIOpen Source
18智谱GLM-5智谱AI1297.00+/-440,865智谱AIOpen Source
19AnthropicOpus 4.5Anthropic1293.00+/-4.429,814AnthropicProprietary
20Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1292.00+/-4.923,843Google Deep MindProprietary
21Moonshot AIKimi K2.5 (thinking)Moonshot AI1288.00+/-4.235,262Moonshot AIOpen Source
22AnthropicClaude Opus 4.8Anthropic1282.00+/-7.78,538AnthropicProprietary
23MiniMaxAIMiniMax-M2.7MiniMaxAI1282.00+/-4.726,278MiniMaxAIOpen Source
24智谱GLM-5V-Turbo智谱AI1280.00+/-4.726,151智谱AIOpen Source
25Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind1279.00+/-4.528,225Google Deep MindProprietary
26阿里Qwen 3.6 Plus Preview阿里巴巴1278.00+/-5.220,906阿里巴巴Proprietary
27智谱GLM-4.7智谱AI1269.00+/-3.842,337智谱AIOpen Source
28xAIGrok 4.20 Beta ReasoningxAI1269.00+/-5.220,044xAIProprietary
29OpenAIGPT-5.4 (Design Skill, Medium)OpenAI1266.00+/-8.17,633OpenAIProprietary
30DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI1264.00+/-5.319,662DeepSeek-AIOpen Source
31OpenAIGPT-5.4 (medium)OpenAI1261.00+/-5.915,138OpenAIProprietary
32MiniMaxAIMiniMax M2.5MiniMaxAI1258.00+/-6.711,504MiniMaxAIOpen Source
33xAIGrok 4.3 BetaxAI1252.00+/-614,749xAIProprietary
34xAIGrok 4.20 BetaxAI1249.00+/-5.120,935xAIProprietary
35MiniMaxAIM2.1MiniMaxAI1242.00+/-5.120,803MiniMaxAIOpen Source
36Google Deep MindGemini 3.0 FlashGoogle Deep Mind1241.00+/-10.64,414Google Deep MindProprietary
37AnthropicClaude Sonnet 4.5 (thinking)Anthropic1234.00+/-4.134,271AnthropicProprietary
38AnthropicClaude Sonnet 4.5Anthropic1233.00+/-4.134,958AnthropicProprietary
39阿里Qwen3.5-397B-A17B阿里巴巴1231.00+/-7.98,129阿里巴巴Open Source
40OpenAIGPT-5.4 (low)OpenAI1230.00+/-5.616,972OpenAIProprietary
41OpenAIGPT-5.4 (None)OpenAI1230.00+/-5.319,064OpenAIProprietary
42智谱GLM-4.7-Flash智谱AI1229.00+/-6.611,706智谱AIOpen Source
43AnthropicClaude Sonnet 3.7Anthropic1228.00+/-5.915,245AnthropicProprietary
44DeepSeek-AIDeepSeek-V3.1 (thinking)DeepSeek-AI1227.00+/-5.716,258DeepSeek-AIOpen Source
45AnthropicOpus 4.1 (thinking)Anthropic1223.00+/-5.815,677AnthropicProprietary
46OpenAIGPT-5.1 (high)OpenAI1223.00+/-5.716,057OpenAIProprietary
47DeepSeek-AIDeepSeek V3.2-ExpDeepSeek-AI1222.00+/-5.219,490DeepSeek-AIOpen Source
48OpenAIGPT-5.2 (None)OpenAI1221.00+/-4.625,824OpenAIProprietary
49OpenAIGPT-5.2 (medium)OpenAI1221.00+/-4.724,671OpenAIProprietary
50OpenAIGPT-5 (high)OpenAI1220.00+/-6.213,397OpenAIProprietary
51阿里Qwen3.5 Plus (0215)阿里巴巴1219.00+/-5.318,993阿里巴巴Proprietary
52DeepSeek-AIDeepSeek V3.2DeepSeek-AI1218.00+/-4.824,314DeepSeek-AIOpen Source
53StepFunStep 3.7 FlashStepFun1218.00+/-8.47,214StepFunOpen Source
54智谱GLM-4.5智谱AI1217.00+/-5.219,637智谱AIOpen Source
55智谱GLM-4.6智谱AI1217.00+/-5.616,911智谱AIOpen Source
56OpenAIGPT-5 (minimal)OpenAI1217.00+/-4.233,232OpenAIProprietary
57OpenAIGPT-5.2 (low)OpenAI1217.00+/-4.625,745OpenAIProprietary
58AnthropicOpus 4.1Anthropic1216.00+/-4.134,520AnthropicProprietary
59OpenAIGPT-5.1 (medium)OpenAI1213.00+/-521,291OpenAIProprietary
60AnthropicClaude Opus 4Anthropic1212.00+/-5.616,669AnthropicProprietary
61OpenAIGPT-5.1 (low)OpenAI1207.00+/-4.922,159OpenAIProprietary
62XIMiMo-V2-FlashXiaomi1207.00+/-4.134,555XiaomiOpen Source
63Google Deep MindGemini 2.5-ProGoogle Deep Mind1205.00+/-8.57,044Google Deep MindProprietary
64OpenAIGPT-5.1 CodexOpenAI1202.00+/-16.41,807OpenAIProprietary
65OpenAIGPT-5.1 (None)OpenAI1202.00+/-4.922,276OpenAIProprietary
66OpenAIGPT-5.2 (high)OpenAI1201.00+/-10.84,167OpenAIProprietary
67OpenAIGPT-5.3 CodexOpenAI1196.00+/-5.815,763OpenAIProprietary
68阿里Qwen3-Coder-480B-A35B阿里巴巴1194.00+/-16.31,958阿里巴巴Open Source
69AnthropicClaude Sonnet 4Anthropic1193.00+/-5.517,532AnthropicProprietary
70MistralAIMistral Large 3MistralAI1193.00+/-4.330,809MistralAIOpen Source
71DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI1190.00+/-5.417,944DeepSeek-AIOpen Source
72智谱GLM-4.5-Air智谱AI1189.00+/-5.517,256智谱AIOpen Source
73AnthropicClaude Sonnet 4 (thinking)Anthropic1188.00+/-5.716,227AnthropicProprietary
74MiniMaxAIMiniMax M2MiniMaxAI1186.00+/-6.810,828MiniMaxAIOpen Source
75DEAesCoder-4BDesignFlow1176.00+/-3.939,734DesignFlowOpen Source
76MistralAIMistral Medium 3.5MistralAI1174.00+/-710,885MistralAIOpen Source
77MistralMistral Medium 3.1 (2508)Mistral1172.00+/-4.527,998MistralProprietary
78ARTrinity Large ThinkingArcee AI1168.00+/-6.512,815Arcee AIOpen Source
79AnthropicHaiku 4.5Anthropic1166.00+/-435,968AnthropicProprietary
80OpenAIGPT-5-miniOpenAI1166.00+/-4.233,066OpenAIProprietary
81DeepSeek-AIDeepSeek-V3.1DeepSeek-AI1163.00+/-5.120,278DeepSeek-AIOpen Source
82阿里Qwen3-Max-Thinking阿里巴巴1161.00+/-4.233,787阿里巴巴Proprietary
83DeepSeek-AIDeepSeek-V3-0324DeepSeek-AI1160.00+/-5.219,257DeepSeek-AIOpen Source
84PRPrime Intellect: INTELLECT-3Prime Intellect1158.00+/-4.331,318Prime IntellectOpen Source
85Google Deep MindGemini 2.5 Flash-Preview-09-2025Google Deep Mind1156.00+/-5.219,299Google Deep MindProprietary
86xAIGrok 4 FastxAI1152.00+/-437,227xAIProprietary
87Moonshot AIKimi K2 0905Moonshot AI1149.00+/-17.91,504Moonshot AIOpen Source
88OpenAIGPT-5.1 Codex MiniOpenAI1145.00+/-4.233,970OpenAIProprietary
89xAIGrok 4.1 FastxAI1144.00+/-4.233,893xAIProprietary
90xAIGrok 4.1 Fast (reasoning)xAI1139.00+/-4.331,593xAIProprietary
91OpenAIGPT-5-NanoOpenAI1136.00+/-8.66,710OpenAIProprietary
92Moonshot AIKimi K2 Turbo PreviewMoonshot AI1135.00+/-15.22,094Moonshot AIOpen Source
93Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025Google Deep Mind1133.00+/-8.56,860Google Deep MindProprietary
94GoogleGemini 3.1 Flash-Lite PreviewGoogle1123.00+/-523,509GoogleProprietary
95Microsoft AzurePhi-3-medium 14B-previewMicrosoft Azure1121.00+/-8.96,396Microsoft AzureProprietary
96MistralAIMinistral 3 14BMistralAI1117.00+/-14.42,379MistralAIOpen Source
97Google Deep MindGemini 2.5 FlashGoogle Deep Mind1111.00+/-8.56,960Google Deep MindProprietary
98REReve v1.5Reve AI1108.00+/-6.911,081Reve AIProprietary
99MistralAIMinistral 3 8BMistralAI1105.00+/-14.32,427MistralAIOpen Source
100xAIGrok 3xAI1104.00+/-4.626,860xAIProprietary
101xAIGrok 4 Fast (reasoning)xAI1103.00+/-4.137,880xAIProprietary
102阿里Qwen3-235B-A22B-2507阿里巴巴1090.00+/-8.66,932阿里巴巴Open Source
103Moonshot AIKimi K2Moonshot AI1085.00+/-19.41,352Moonshot AIOpen Source
104MistralMagistral Medium 1.2 (2509)Mistral1085.00+/-9.45,851MistralProprietary
105AlibabaQwen3-235B-A22B-Thinking-2507Alibaba1084.00+/-9.16,169AlibabaOpen Source
106OpenAIGPT-4.1OpenAI1077.00+/-17.31,747OpenAIProprietary
107OpenAIOpenAI o3OpenAI1071.00+/-19.51,365OpenAIProprietary
108xAIGrok 4xAI1068.00+/-4.923,998xAIProprietary
109MistralAIDevstral MediumMistralAI1064.00+/-8.57,158MistralAIProprietary
110MistralMinistral 3 3B (2512)Mistral1062.00+/-13.52,852MistralOpen Source
111MistralCodestral 2508Mistral1059.00+/-8.86,745MistralProprietary
112阿里Qwen3-235B-A22B阿里巴巴1054.00+/-10.15,154阿里巴巴Open Source
113xAIGrok Code Fast 1xAI1050.00+/-11.14,295xAIProprietary
114OpenAIGPT-4.1 miniOpenAI1045.00+/-18.31,566OpenAIProprietary
115MistralMagistral Small 1.2 (2509)Mistral1037.00+/-9.26,448MistralOpen Source
116OpenAIOpenAI o4 - miniOpenAI1027.00+/-16.22,011OpenAIProprietary
117ALOlmo 3.1 32B ThinkAllen AI1026.00+/-6.316,162Allen AIOpen Source
118OpenAIGPT OSS 120BOpenAI1015.00+/-10.35,268OpenAIOpen Source
119OpenAIGPT-4.1 nanoOpenAI1014.00+/-16.81,901OpenAIProprietary
120阿里Qwen3-30B-A3B阿里巴巴993.00+/-14.52,575阿里巴巴Open Source
121xAIGrok 3 minixAI982.00+/-8.77,626xAIProprietary
122NVIDIALlama 3.1 Nemotron Ultra 253BNVIDIA981.00+/-13.83,172NVIDIAOpen Source
123MistralAIMistral-Small-3.2MistralAI958.00+/-20.81,243MistralAIOpen Source
124FALlama 4 MaverickFacebook AI研究实验室931.00+/-18.41,678Facebook AI研究实验室Open Source
125MistralMistral Large 2.1 (2411)Mistral915.00+/-211,317MistralProprietary
126OpenAIGPT-4oOpenAI912.00+/-18.11,780OpenAIProprietary
127MistralCodestral 2 (2501)Mistral885.00+/-20.61,444MistralOpen Source
128MistralAIDevstral Small 1.1MistralAI859.00+/-22.51,250MistralAIOpen Source
129FALlama 4 ScoutFacebook AI研究实验室841.00+/-22.61,275Facebook AI研究实验室Open Source

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

关于本榜单

本榜单数据来源于Design Arena,由 Y Combinator 支持的 Arcada Labs 开发,是专注于评测 AI 设计代码生成能力的众包匿名对战平台。

与 LMArena 评测通用文本和编程能力不同,Design Arena 的代码榜专门考察模型生成具有视觉呈现效果的前端代码的能力。平台将代码任务细分为 Website、UI 组件、游戏开发、数据可视化、SVG、Web App、移动端等多个子类别,每个子类别均有独立排行。

本页展示的是 Code Categories 综合榜,即将所有子类别的用户投票混合汇总后,统一用 Bradley-Terry 模型(类 Elo 算法)计算出的综合排名。每票等权,不对各子类别做加权处理,因此投票量较大的子类别(如 Website)对综合分数的影响更大。得分越高,代表模型在设计代码生成场景下的综合人类偏好越强。

常见问题 (FAQ)

01

什么是 Arcada Labs Code Categories Arena?

Arcada Labs Code Categories Arena 是专注于设计代码生成能力的匿名评测平台,覆盖 Website、UI 组件、游戏开发、数据可视化等多个代码生成子类别,并将投票汇总为综合榜单。

02

Arcada Code Arena 与 LMArena Coding Arena 有什么区别?

LMArena Coding Arena 主要评测通用编程能力,例如代码生成、调试和算法实现;Arcada Code Arena 专注于具有视觉呈现效果的前端设计代码,例如 HTML 页面、交互 UI、图表、SVG 和原型。

03

排名方法论是什么?

Arcada Labs 将各代码子类别的原始投票混合后运行 Bradley-Terry 模型。每票等权,不按子类别单独加权,因此投票量较大的子类别会对综合分数产生更大影响。

04

哪类模型在设计代码场景表现更好?

具备强视觉理解和前端代码生成能力的大模型通常表现更好。针对 UI 和代码生成优化的专项模型,在布局、交互和视觉细节任务上也可能有突出表现。