LMArena Coding Arena 代码能力排行榜
基于 LMArena Coding Arena 用户匿名投票的最新AI大模型代码编程能力排行榜,涵盖各模型的 Elo 得分、95% 置信区间、投票量、机构与许可证。
榜首模型
Claude Opus 4.6 (thinking)
最高得分
1553.00
模型数量
360
数据版本
2026年06月05日
数据来源: LM Arena
关于本排行榜
本排行榜展示了当前 AI 大模型在代码编程任务中的实力排名。数据来源于 LMArena (前身为 LMSYS Chatbot Arena)的 Coding 子赛道,通过真实用户匿名盲测投票评估各模型在代码编程任务中的表现。
评测方法概要
匿名盲测:用户发出编程问题后,由两个"隐藏身份"的模型分别给出代码解答,用户投票选出更好的回答,排除品牌偏见。
Elo 评分:采用 Bradley-Terry 模型计算 Elo 分数,分数越高说明该模型的代码回答越容易被用户选择。
覆盖多种编程场景:包括代码生成、Bug 修复、算法实现、代码解释等高频真实编程场景。
DataLearner 在原始数据基础上提供中文解读与深度分析,并将排行榜模型关联至 DataLearner 模型库,方便您一键查看模型详情、API 定价、评测得分等完整信息。
排名总表
| 排名 | 模型名称 | 得分 | 95% CI | 投票数 | 机构 | 许可证 |
|---|---|---|---|---|---|---|
Claude Opus 4.6 (thinking)Anthropic | 1553.00 | +/-7 | 9,727 | Anthropic | Proprietary | |
Opus 4.7 (thinking)Anthropic | 1552.00 | +/-8 | 7,038 | Anthropic | Proprietary | |
Opus 4.7Anthropic | 1549.00 | +/-8 | 7,375 | Anthropic | Proprietary | |
| 4 | claude-opus-4-8-thinkingAnthropic | 1549.00 | +/-15 | 1,565 | Anthropic | Proprietary |
| 5 | Claude Opus 4.6Anthropic | 1548.00 | +/-7 | 11,224 | Anthropic | Proprietary |
| 6 | claude-opus-4-8Anthropic | 1543.00 | +/-15 | 1,686 | Anthropic | Proprietary |
| 7 | Claude Opus 4 (thinking-32k)Anthropic | 1530.00 | +/-7 | 7,621 | Anthropic | Proprietary |
| 8 | GLM 5.1智谱AI | 1529.00 | +/-10 | 4,196 | 智谱AI | MIT |
| 9 | Muse SparkFacebook AI研究实验室 | 1528.00 | +/-10 | 3,566 | Facebook AI研究实验室 | Proprietary |
| 10 | Gemini 3.1 Pro PreviewGoogle Deep Mind | 1527.00 | +/-6 | 13,244 | Google Deep Mind | Proprietary |
| 11 | Qwen3.7-Max-Preview阿里巴巴 | 1525.00 | +/-18 | 1,136 | 阿里巴巴 | Proprietary |
| 12 | Claude Sonnet 4.6Anthropic | 1524.00 | +/-7 | 8,620 | Anthropic | Proprietary |
| 13 | Claude Opus 4Anthropic | 1523.00 | +/-6 | 17,039 | Anthropic | Proprietary |
| 14 | GPT-5.5 (high)OpenAI | 1522.00 | +/-9 | 5,734 | OpenAI | Proprietary |
| 15 | GPT-5.4 (high)OpenAI | 1520.00 | +/-7 | 8,625 | OpenAI | Proprietary |
| 16 | Claude Sonnet 4.5 (thinking-32k)Anthropic | 1520.00 | +/-5 | 19,003 | Anthropic | Proprietary |
| 17 | Gemini 3.0 Pro (Preview 11-2025)Google Deep Mind | 1519.00 | +/-7 | 8,573 | Google Deep Mind | Proprietary |
| 18 | GPT-5.2OpenAI | 1516.00 | +/-7 | 8,804 | OpenAI | Proprietary |
| 19 | 1516.00 | +/-7 | 9,074 | xAI | Proprietary | |
| 20 | ERNIE-5.1-Preview百度 | 1516.00 | +/-9 | 5,094 | 百度 | Proprietary |
| 21 | MiMo V2.5 ProXiaomi | 1516.00 | +/-9 | 5,426 | Xiaomi | MIT |
| 22 | Claude Sonnet 4.5Anthropic | 1514.00 | +/-5 | 18,877 | Anthropic | Proprietary |
| 23 | Opus 4.1 (thinking-16k)Anthropic | 1513.00 | +/-6 | 9,843 | Anthropic | Proprietary |
| 24 | Qwen3.5 Max Preview阿里巴巴 | 1513.00 | +/-8 | 5,827 | 阿里巴巴 | Proprietary |
| 25 | 1512.00 | +/-7 | 9,236 | xAI | Proprietary | |
| 26 | GPT-5.5 InstantOpenAI | 1512.00 | +/-8 | 7,571 | OpenAI | Proprietary |
| 27 | GPT-5.4OpenAI | 1512.00 | +/-7 | 9,532 | OpenAI | Proprietary |
| 28 | DOLA Seed 2.0 Pro字节跳动Seed团队 | 1511.00 | +/-7 | 11,535 | 字节跳动Seed团队 | Proprietary |
| 29 | Kimi K2.6Moonshot AI | 1511.00 | +/-9 | 5,269 | Moonshot AI | Modified MIT |
| 30 | minimax-m3MiniMax | 1509.00 | +/-17 | 1,320 | MiniMax | Proprietary |
| 31 | Gemini 3.0 FlashGoogle Deep Mind | 1509.00 | +/-8 | 6,384 | Google Deep Mind | Proprietary |
| 32 | 1508.00 | +/-8 | 6,820 | xAI | Proprietary | |
| 33 | Qwen3.6-Max-Preview阿里巴巴 | 1507.00 | +/-16 | 1,460 | 阿里巴巴 | Proprietary |
| 34 | Gemini 3.5 FlashGoogle Deep Mind | 1506.00 | +/-12 | 2,862 | Google Deep Mind | Proprietary |
| 35 | Opus 4.1Anthropic | 1505.00 | +/-5 | 15,530 | Anthropic | Proprietary |
| 36 | Kimi K2.5 InstantMoonshot AI | 1505.00 | +/-14 | 1,800 | Moonshot AI | Modified MIT |
| 37 | Kimi K2 ThinkingMoonshot AI | 1504.00 | +/-6 | 10,784 | Moonshot AI | Modified MIT |
| 38 | MiMo V2 ProXiaomi | 1504.00 | +/-8 | 6,640 | Xiaomi | Proprietary |
| 39 | DeepSeek-V4-ProDeepSeek-AI | 1504.00 | +/-8 | 6,330 | DeepSeek-AI | MIT |
| 40 | GPT-5.5OpenAI | 1502.00 | +/-8 | 6,021 | OpenAI | Proprietary |
| 41 | LongCat Flash Chat (2602)Meituan | 1502.00 | +/-8 | 7,584 | Meituan | Proprietary |
| 42 | GPT-5.4 mini (high)OpenAI | 1499.00 | +/-7 | 8,409 | OpenAI | Proprietary |
| 43 | Claude Opus 4 (thinking-16k)Anthropic | 1498.00 | +/-8 | 6,676 | Anthropic | Proprietary |
| 44 | 1498.00 | +/-6 | 14,783 | xAI | Proprietary | |
| 45 | Gemma 4 31BDeepMind | 1497.00 | +/-15 | 1,363 | DeepMind | Apache 2.0 |
| 46 | GPT-5.3OpenAI | 1497.00 | +/-7 | 8,449 | OpenAI | Proprietary |
| 47 | GLM-5智谱AI | 1496.00 | +/-8 | 5,667 | 智谱AI | MIT |
| 48 | DeepSeek-V4-Pro (thinking)DeepSeek-AI | 1495.00 | +/-9 | 5,801 | DeepSeek-AI | MIT |
| 49 | Qwen3.5-397B-A17B阿里巴巴 | 1494.00 | +/-7 | 10,036 | 阿里巴巴 | Apache 2.0 |
| 50 | Qwen 3.6 Plus Preview阿里巴巴 | 1493.00 | +/-8 | 6,769 | 阿里巴巴 | Proprietary |
| 51 | Gemini 3.0 Flash (minimal)Google Deep Mind | 1492.00 | +/-6 | 14,459 | Google Deep Mind | Proprietary |
| 52 | ERNIE 5.0百度 | 1491.00 | +/-7 | 8,416 | 百度 | Proprietary |
| 53 | 1491.00 | +/-6 | 15,318 | xAI | Proprietary | |
| 54 | GPT-5.1 Pro (high)OpenAI | 1490.00 | +/-7 | 8,210 | OpenAI | Proprietary |
| 55 | GPT-5.2 Pro (high)OpenAI | 1490.00 | +/-6 | 11,437 | OpenAI | Proprietary |
| 56 | 1490.00 | +/-9 | 5,799 | xAI | Proprietary | |
| 57 | MiMo V2.5Xiaomi | 1489.00 | +/-9 | 5,874 | Xiaomi | MIT |
| 58 | amazon-nova-experimental-chat-26-02-10Amazon | 1488.00 | +/-20 | 841 | Amazon | Proprietary |
| 59 | Kimi K2 Thinking (thinking-turbo)Moonshot AI | 1487.00 | +/-6 | 14,542 | Moonshot AI | Modified MIT |
| 60 | GLM-4.7智谱AI | 1486.00 | +/-12 | 2,411 | 智谱AI | MIT |
| 61 | GPT-5.2OpenAI | 1483.00 | +/-6 | 12,897 | OpenAI | Proprietary |
| 62 | MiMo V2 OmniXiaomi | 1483.00 | +/-14 | 1,815 | Xiaomi | Proprietary |
| 63 | Qwen3 Max (Preview)阿里巴巴 | 1482.00 | +/-8 | 5,366 | 阿里巴巴 | Proprietary |
| 64 | Gemma 4 26B A4BDeepMind | 1480.00 | +/-15 | 1,367 | DeepMind | Apache 2.0 |
| 65 | DeepSeek-V4-FlashDeepSeek-AI | 1480.00 | +/-8 | 6,163 | DeepSeek-AI | MIT |
| 66 | amazon-nova-experimental-chat-26-01-10Amazon | 1480.00 | +/-21 | 735 | Amazon | Proprietary |
| 67 | Haiku 4.5Anthropic | 1479.00 | +/-5 | 19,804 | Anthropic | Proprietary |
| 68 | 1476.00 | +/-7 | 7,987 | MiniMaxAI | Modified MIT | |
| 69 | DeepSeek V3.2 (thinking)DeepSeek-AI | 1476.00 | +/-7 | 8,375 | DeepSeek-AI | MIT |
| 70 | DeepSeek-V4-Flash (thinking)DeepSeek-AI | 1476.00 | +/-8 | 6,073 | DeepSeek-AI | MIT |
| 71 | qwen3-max-2025-09-23Alibaba | 1475.00 | +/-13 | 2,041 | Alibaba | Proprietary |
| 72 | DeepSeek V3.2-Exp (thinking)DeepSeek-AI | 1475.00 | +/-13 | 1,920 | DeepSeek-AI | MIT |
| 73 | LongCat Flash Chat (2602)Meituan | 1474.00 | +/-13 | 2,233 | Meituan | MIT |
| 74 | GPT-5.1 InstantOpenAI | 1474.00 | +/-7 | 9,126 | OpenAI | Proprietary |
| 75 | Qwen3-235B-A22B-2507阿里巴巴 | 1473.00 | +/-5 | 21,022 | 阿里巴巴 | Apache 2.0 |
| 76 | Claude Sonnet 4 (thinking-32k)Anthropic | 1473.00 | +/-8 | 6,414 | Anthropic | Proprietary |
| 77 | ERNIE 5.0百度 | 1472.00 | +/-13 | 1,955 | 百度 | Proprietary |
| 78 | mistral-medium-3.5Mistral | 1472.00 | +/-14 | 1,810 | Mistral | Modified MIT |
| 79 | DeepSeek V3.2DeepSeek-AI | 1469.00 | +/-6 | 10,431 | DeepSeek-AI | MIT |
| 80 | GPT-4o(2025-03-27)OpenAI | 1469.00 | +/-5 | 15,868 | OpenAI | Proprietary |
| 81 | Kimi K2 0905Moonshot AI | 1467.00 | +/-13 | 2,243 | Moonshot AI | Modified MIT |
| 82 | GPT-5-Pro (high)OpenAI | 1467.00 | +/-8 | 6,358 | OpenAI | Proprietary |
| 83 | Mistral Large 3MistralAI | 1467.00 | +/-6 | 9,770 | MistralAI | Apache 2.0 |
| 84 | DeepSeek V3.2-ExpDeepSeek-AI | 1466.00 | +/-12 | 2,501 | DeepSeek-AI | MIT |
| 85 | Qwen3-VL-235B-A22B-Instruct阿里巴巴 | 1466.00 | +/-13 | 2,314 | 阿里巴巴 | Apache 2.0 |
| 86 | Gemini 2.5 Pro Experimental 03-25Google Deep Mind | 1465.00 | +/-4 | 26,215 | Google Deep Mind | Proprietary |
| 87 | DeepSeek-R1-0528DeepSeek-AI | 1465.00 | +/-11 | 2,729 | DeepSeek-AI | MIT |
| 88 | Claude Opus 4Anthropic | 1464.00 | +/-7 | 7,900 | Anthropic | Proprietary |
| 89 | GPT-5OpenAI | 1463.00 | +/-8 | 5,989 | OpenAI | Proprietary |
| 90 | DeepSeek-V3.1 Terminus (thinking)DeepSeek-AI | 1463.00 | +/-24 | 636 | DeepSeek-AI | MIT |
| 91 | 1462.00 | +/-6 | 13,198 | xAI | Proprietary | |
| 92 | hunyuan-hy3-previewTencent | 1461.00 | +/-14 | 1,834 | Tencent | tencent-hunyuan-community |
| 93 | Kimi K2Moonshot AI | 1460.00 | +/-8 | 5,243 | Moonshot AI | Modified MIT |
| 94 | GLM-4.6智谱AI | 1460.00 | +/-7 | 7,481 | 智谱AI | MIT |
| 95 | GPT-4.5OpenAI | 1459.00 | +/-13 | 1,939 | OpenAI | Proprietary |
| 96 | 1459.00 | +/-16 | 1,249 | xAI | Proprietary | |
| 97 | OpenAI o3OpenAI | 1459.00 | +/-6 | 11,749 | OpenAI | Proprietary |
| 98 | GPT-5.4 nano (high)OpenAI | 1458.00 | +/-7 | 8,396 | OpenAI | Proprietary |
| 99 | Qwen3.5-122B-A10B阿里巴巴 | 1458.00 | +/-7 | 7,490 | 阿里巴巴 | Apache 2.0 |
| 100 | gemini-3.1-flash-lite-previewGoogle | 1458.00 | +/-7 | 10,841 | Proprietary | |
| 101 | Qwen3-Coder-480B-A35B阿里巴巴 | 1457.00 | +/-9 | 4,852 | 阿里巴巴 | Apache 2.0 |
| 102 | DeepSeek-V3.1 (thinking)DeepSeek-AI | 1457.00 | +/-13 | 1,905 | DeepSeek-AI | MIT |
| 103 | Magistral-Medium-2506MistralAI | 1456.00 | +/-5 | 20,831 | MistralAI | Proprietary |
| 104 | GPT-4.1OpenAI | 1456.00 | +/-7 | 9,316 | OpenAI | Proprietary |
| 105 | Qwen3-VL-235B-A22B-Instruct (thinking)阿里巴巴 | 1455.00 | +/-14 | 1,626 | 阿里巴巴 | Apache 2.0 |
| 106 | GLM-4.5智谱AI | 1455.00 | +/-9 | 4,773 | 智谱AI | MIT |
| 107 | Claude Sonnet 3.7 (thinking-32k)Anthropic | 1451.00 | +/-8 | 6,191 | Anthropic | Proprietary |
| 108 | nvidia-nemotron-3-ultra-550b-a55b-nvfp4Nvidia | 1450.00 | +/-22 | 788 | Nvidia | OpenMDW-1.1 |
| 109 | Step 3.5 FlashStepFunAI | 1450.00 | +/-7 | 9,448 | StepFunAI | Apache 2.0 |
| 110 | Qwen3.5-27B阿里巴巴 | 1449.00 | +/-8 | 7,228 | 阿里巴巴 | Apache 2.0 |
| 111 | Claude Sonnet 4Anthropic | 1449.00 | +/-7 | 7,397 | Anthropic | Proprietary |
| 112 | DeepSeek-V3.1DeepSeek-AI | 1447.00 | +/-12 | 2,625 | DeepSeek-AI | MIT |
| 113 | qwen3-235b-a22b-no-thinkingAlibaba | 1446.00 | +/-8 | 6,977 | Alibaba | Apache 2.0 |
| 114 | Qwen3-Next阿里巴巴 | 1446.00 | +/-9 | 4,791 | 阿里巴巴 | Apache 2.0 |
| 115 | mimo-v2-flash (non-thinking)Xiaomi | 1445.00 | +/-6 | 11,686 | Xiaomi | MIT |
| 116 | DeepSeek-R1DeepSeek-AI | 1445.00 | +/-12 | 2,317 | DeepSeek-AI | MIT |
| 117 | 1444.00 | +/-7 | 10,571 | MiniMaxAI | Modified MIT | |
| 118 | 1443.00 | +/-8 | 5,402 | xAI | Proprietary | |
| 119 | qwen3-235b-a22b-thinking-2507Alibaba | 1442.00 | +/-15 | 1,612 | Alibaba | Apache 2.0 |
| 120 | trinity-large-previewArcee AI | 1441.00 | +/-8 | 7,347 | Arcee AI | Apache 2.0 |
| 121 | Qwen3-30B-A3B-2507阿里巴巴 | 1440.00 | +/-9 | 4,663 | 阿里巴巴 | Apache 2.0 |
| 122 | 1439.00 | +/-10 | 3,426 | MiniMaxAI | MIT | |
| 123 | DeepSeek-V3.1 TerminusDeepSeek-AI | 1439.00 | +/-21 | 778 | DeepSeek-AI | MIT |
| 124 | hunyuan-vision-1.5-thinkingTencent | 1438.00 | +/-27 | 437 | Tencent | Proprietary |
| 125 | 1437.00 | +/-9 | 3,956 | xAI | Proprietary | |
| 126 | Qwen3.5-35B-A3B阿里巴巴 | 1436.00 | +/-7 | 7,669 | 阿里巴巴 | Apache 2.0 |
| 127 | 1436.00 | +/-7 | 8,157 | xAI | Proprietary | |
| 128 | amazon-nova-experimental-chat-12-10Amazon | 1435.00 | +/-21 | 704 | Amazon | Proprietary |
| 129 | OpenAI o3-mini (high)OpenAI | 1435.00 | +/-12 | 2,596 | OpenAI | Proprietary |
| 130 | Claude 3.5 SonnetAnthropic | 1434.00 | +/-6 | 14,964 | Anthropic | Proprietary |
| 131 | Qwen3-235B-A22B阿里巴巴 | 1434.00 | +/-9 | 4,341 | 阿里巴巴 | Apache 2.0 |
| 132 | ERNIE 5.0百度 | 1433.00 | +/-19 | 916 | 百度 | Proprietary |
| 133 | mistral-medium-2505Mistral | 1433.00 | +/-8 | 5,900 | Mistral | Proprietary |
| 134 | GPT-4.1 miniOpenAI | 1433.00 | +/-7 | 6,919 | OpenAI | Proprietary |
| 135 | OpenAI o1OpenAI | 1433.00 | +/-10 | 3,973 | OpenAI | Proprietary |
| 136 | OpenAI o4 - miniOpenAI | 1432.00 | +/-7 | 8,720 | OpenAI | Proprietary |
| 137 | mimo-v2-flash (thinking)Xiaomi | 1432.00 | +/-12 | 2,444 | Xiaomi | MIT |
| 138 | Step 3.5 FlashStepFunAI | 1431.00 | +/-7 | 9,520 | StepFunAI | Proprietary |
| 139 | GPT-5-mini (high)OpenAI | 1430.00 | +/-8 | 5,500 | OpenAI | Proprietary |
| 140 | Claude Sonnet 3.7Anthropic | 1429.00 | +/-7 | 7,145 | Anthropic | Proprietary |
| 141 | DeepSeek-V3-0324DeepSeek-AI | 1429.00 | +/-7 | 8,367 | DeepSeek-AI | MIT |
| 142 | Gemini 2.5 Flash-Preview-09-2025Google Deep Mind | 1428.00 | +/-8 | 6,842 | Google Deep Mind | Proprietary |
| 143 | GLM-4.5-Air智谱AI | 1427.00 | +/-8 | 6,106 | 智谱AI | MIT |
| 144 | Gemini 2.5 FlashGoogle Deep Mind | 1424.00 | +/-4 | 25,609 | Google Deep Mind | Proprietary |
| 145 | GLM-4.7-Flash智谱AI | 1423.00 | +/-11 | 2,690 | 智谱AI | MIT |
| 146 | Qwen3-Next (thinking)阿里巴巴 | 1421.00 | +/-11 | 2,676 | 阿里巴巴 | Apache 2.0 |
| 147 | amazon-nova-experimental-chat-11-10Amazon | 1420.00 | +/-8 | 5,318 | Amazon | Proprietary |
| 148 | GLM-4.6V智谱AI | 1418.00 | +/-25 | 534 | 智谱AI | MIT |
| 149 | OpenAI o1OpenAI | 1417.00 | +/-9 | 5,123 | OpenAI | Proprietary |
| 150 | minimax-m1MiniMax | 1416.00 | +/-8 | 6,486 | MiniMax | Apache 2.0 |
| 151 | OpenAI o3-miniOpenAI | 1416.00 | +/-6 | 9,461 | OpenAI | Proprietary |
| 152 | trinity-large-thinkingArcee AI | 1414.00 | +/-8 | 7,789 | Arcee AI | Apache 2.0 |
| 153 | Mistral-Small-3.2MistralAI | 1413.00 | +/-10 | 3,359 | MistralAI | Apache 2.0 |
| 154 | ling-flash-2.0Ant Group | 1413.00 | +/-15 | 1,528 | Ant Group | MIT |
| 155 | amazon-nova-experimental-chat-10-20Amazon | 1411.00 | +/-12 | 2,292 | Amazon | Proprietary |
| 156 | nvidia-nemotron-3-super-120b-a12bNvidia | 1410.00 | +/-14 | 1,766 | Nvidia | NVIDIA Open Model |
| 157 | intellect-3Prime Intellect | 1409.00 | +/-19 | 972 | Prime Intellect | MIT |
| 158 | Step3StepFunAI | 1408.00 | +/-17 | 1,232 | StepFunAI | Apache 2.0 |
| 159 | Qwen3-32B阿里巴巴 | 1408.00 | +/-24 | 513 | 阿里巴巴 | Apache 2.0 |
| 160 | nvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia | 1405.00 | +/-22 | 659 | Nvidia | Nvidia Open |
| 161 | GLM-4.5V智谱AI | 1405.00 | +/-18 | 991 | 智谱AI | MIT |
| 162 | Qwen2.5-Max阿里巴巴 | 1403.00 | +/-8 | 5,102 | 阿里巴巴 | Proprietary |
| 163 | hunyuan-turbos-20250226Tencent | 1400.00 | +/-31 | 275 | Tencent | Proprietary |
| 164 | Hunyuan-T1腾讯AI实验室 | 1400.00 | +/-20 | 805 | 腾讯AI实验室 | Proprietary |
| 165 | Claude 3.5 SonnetAnthropic | 1398.00 | +/-7 | 13,607 | Anthropic | Proprietary |
| 166 | Gemini 2.5 Flash-Lite-Preview-09-2025 (no-thinking)Google Deep Mind | 1398.00 | +/-7 | 9,679 | Google Deep Mind | Proprietary |
| 167 | Nova 2 Lite亚马逊 | 1397.00 | +/-12 | 2,516 | 亚马逊 | Proprietary |
| 168 | mercury-2Inception AI | 1397.00 | +/-21 | 768 | Inception AI | Proprietary |
| 169 | hunyuan-turbos-20250416Tencent | 1394.00 | +/-14 | 1,776 | Tencent | Proprietary |
| 170 | llama-3.1-nemotron-ultra-253b-v1Nvidia | 1391.00 | +/-30 | 367 | Nvidia | Nvidia Open Model |
| 171 | ring-flash-2.0Ant Group | 1391.00 | +/-15 | 1,540 | Ant Group | MIT |
| 172 | GPT OSS 120BOpenAI | 1391.00 | +/-8 | 6,494 | OpenAI | Apache 2.0 |
| 173 | OpenAI o3-mini (high)OpenAI | 1390.00 | +/-10 | 3,296 | OpenAI | Proprietary |
| 174 | C4AI Command A (202503)CohereAI | 1390.00 | +/-6 | 10,221 | CohereAI | CC-BY-NC-4.0 |
| 175 | amazon-nova-experimental-chat-10-09Amazon | 1389.00 | +/-24 | 552 | Amazon | Proprietary |
| 176 | OpenAI o1-miniOpenAI | 1388.00 | +/-7 | 8,478 | OpenAI | Proprietary |
| 177 | DeepSeek-V3DeepSeek-AI | 1388.00 | +/-10 | 3,280 | DeepSeek-AI | DeepSeek |
| 178 | Qwen3-30B-A3B阿里巴巴 | 1387.00 | +/-9 | 4,528 | 阿里巴巴 | Apache 2.0 |
| 179 | 1387.00 | +/-9 | 4,255 | xAI | Proprietary | |
| 180 | Magistral-Medium-2506MistralAI | 1386.00 | +/-12 | 2,248 | MistralAI | Proprietary |
| 181 | QwQ-32B阿里巴巴 | 1385.00 | +/-9 | 4,045 | 阿里巴巴 | Apache 2.0 |
| 182 | Claude 3.5 HaikuAnthropic | 1385.00 | +/-6 | 11,249 | Anthropic | Proprietary |
| 183 | 1385.00 | +/-15 | 1,544 | MiniMaxAI | Apache 2.0 | |
| 184 | olmo-3.1-32b-instructAi2 | 1384.00 | +/-12 | 2,512 | Ai2 | Apache 2.0 |
| 185 | Gemini 2.5 Flash-Lite (thinking)Google Deep Mind | 1384.00 | +/-8 | 6,002 | Google Deep Mind | Proprietary |
| 186 | GPT-5-Nano (high)OpenAI | 1382.00 | +/-15 | 1,684 | OpenAI | Proprietary |
| 187 | qwen-plus-0125Alibaba | 1380.00 | +/-18 | 893 | Alibaba | Proprietary |
| 188 | llama-3.1-405b-instruct-bf16Meta | 1375.00 | +/-8 | 6,249 | Meta | Llama 3.1 Community |
| 189 | deepseek-v2.5-1210DeepSeek | 1375.00 | +/-17 | 1,079 | DeepSeek | DeepSeek |
| 190 | GPT-4.1 nanoOpenAI | 1374.00 | +/-19 | 807 | OpenAI | Proprietary |
| 191 | Llama 4 Maverick InstructFacebook AI研究实验室 | 1373.00 | +/-7 | 6,998 | Facebook AI研究实验室 | Llama 4 |
| 192 | hunyuan-turbo-0110Tencent | 1372.00 | +/-30 | 299 | Tencent | Proprietary |
| 193 | step-2-16k-exp-202412StepFun | 1372.00 | +/-20 | 737 | StepFun | Proprietary |
| 194 | GPT OSS 20BOpenAI | 1370.00 | +/-13 | 2,167 | OpenAI | Apache 2.0 |
| 195 | athene-v2-chatNexusFlow | 1369.00 | +/-9 | 4,019 | NexusFlow | NexusFlow |
| 196 | yi-lightning01 AI | 1369.00 | +/-10 | 4,316 | 01 AI | Proprietary |
| 197 | GPT-4oOpenAI | 1369.00 | +/-6 | 19,526 | OpenAI | Proprietary |
| 198 | DeepSeek V2.5DeepSeek-AI | 1368.00 | +/-9 | 4,252 | DeepSeek-AI | DeepSeek |
| 199 | llama-3.1-405b-instruct-fp8Meta | 1368.00 | +/-7 | 9,714 | Meta | Llama 3.1 Community |
| 200 | mercuryInception AI | 1367.00 | +/-29 | 394 | Inception AI | Proprietary |
| 201 | hunyuan-large-2025-02-10Tencent | 1367.00 | +/-25 | 519 | Tencent | Proprietary |
| 202 | Gemini 2.0 Flash ExperimentalDeepMind | 1365.00 | +/-7 | 6,995 | DeepMind | Proprietary |
| 203 | olmo-3-32b-thinkAi2 | 1365.00 | +/-18 | 1,054 | Ai2 | Apache 2.0 |
| 204 | llama-3.3-nemotron-49b-super-v1Nvidia | 1363.00 | +/-31 | 286 | Nvidia | Nvidia |
| 205 | nvidia-nemotron-3-nano-30b-a3b-bf16Nvidia | 1363.00 | +/-10 | 3,277 | Nvidia | NVIDIA Open Model |
| 206 | Llama 4 Scout InstructFacebook AI研究实验室 | 1362.00 | +/-9 | 5,256 | Facebook AI研究实验室 | Llama |
| 207 | Mistral-Small-3.1-24B-Instruct-2503MistralAI | 1362.00 | +/-8 | 6,137 | MistralAI | Apache 2.0 |
| 208 | GPT-4oOpenAI | 1360.00 | +/-8 | 7,318 | OpenAI | Proprietary |
| 209 | 1359.00 | +/-7 | 10,368 | xAI | Proprietary | |
| 210 | Gemma 3 - 27B (IT)Google Deep Mind | 1358.00 | +/-7 | 8,076 | Google Deep Mind | Gemma |
| 211 | qwen2.5-plus-1127Alibaba | 1357.00 | +/-14 | 1,553 | Alibaba | Proprietary |
| 212 | Gemini 1.5 ProGoogle Deep Mind | 1356.00 | +/-7 | 9,175 | Google Deep Mind | Proprietary |
| 213 | granite-4.1-8bIBM | 1356.00 | +/-20 | 1,025 | IBM | Apache 2.0 |
| 214 | hunyuan-large-visionTencent | 1356.00 | +/-19 | 963 | Tencent | Proprietary |
| 215 | Qwen2.5-VL-72B-Instruct阿里巴巴 | 1356.00 | +/-8 | 6,688 | 阿里巴巴 | Qwen |
| 216 | step-1o-turbo-202506StepFun | 1354.00 | +/-15 | 1,505 | StepFun | Proprietary |
| 217 | Claude3-OpusAnthropic | 1353.00 | +/-6 | 33,748 | Anthropic | Proprietary |
| 218 | mistral-large-2407Mistral | 1353.00 | +/-8 | 7,589 | Mistral | Mistral Research |
| 219 | qwen-max-0919Alibaba | 1353.00 | +/-11 | 2,756 | Alibaba | Qwen |
| 220 | glm-4-plusZhipu AI | 1352.00 | +/-9 | 4,449 | Zhipu AI | Proprietary |
| 221 | athene-70b-0725NexusFlow | 1350.00 | +/-11 | 3,122 | NexusFlow | CC-BY-NC-4.0 |
| 222 | GPT-4o miniOpenAI | 1349.00 | +/-7 | 10,927 | OpenAI | Proprietary |
| 223 | Runway Gen-4 TurboRunway | 1347.00 | +/-7 | 17,104 | Runway | Proprietary |
| 224 | Gemini 1.5 ProGoogle Deep Mind | 1347.00 | +/-8 | 12,747 | Google Deep Mind | Proprietary |
| 225 | mistral-large-2411Mistral | 1346.00 | +/-9 | 4,212 | Mistral | MRL |
| 226 | Llama3.3-70B-InstructFacebook AI研究实验室 | 1345.00 | +/-7 | 8,747 | Facebook AI研究实验室 | Llama-3.3 |
| 227 | Gemini 2.0 Flash-LiteDeepMind | 1343.00 | +/-10 | 3,474 | DeepMind | Proprietary |
| 228 | amazon-nova-pro-v1.0Amazon | 1343.00 | +/-9 | 3,853 | Amazon | Proprietary |
| 229 | Qwen2.5-Coder-32B-Instruct阿里巴巴 | 1342.00 | +/-19 | 873 | 阿里巴巴 | Apache 2.0 |
| 230 | deepseek-coder-v2DeepSeek | 1342.00 | +/-12 | 2,671 | DeepSeek | DeepSeek License |
| 231 | GPT-4OpenAI | 1339.00 | +/-7 | 15,605 | OpenAI | Proprietary |
| 232 | olmo-3.1-32b-thinkAi2 | 1339.00 | +/-15 | 1,566 | Ai2 | Apache 2.0 |
| 233 | gemini-advanced-0514Google | 1338.00 | +/-9 | 8,138 | Proprietary | |
| 234 | 1335.00 | +/-7 | 8,652 | xAI | Proprietary | |
| 235 | Llama3.1-70B-InstructFacebook AI研究实验室 | 1333.00 | +/-7 | 9,389 | Facebook AI研究实验室 | Llama 3.1 Community |
| 236 | hunyuan-standard-2025-02-10Tencent | 1332.00 | +/-24 | 549 | Tencent | Proprietary |
| 237 | GPT-4OpenAI | 1331.00 | +/-8 | 15,289 | OpenAI | Proprietary |
| 238 | glm-4-plus-0111Zhipu | 1331.00 | +/-18 | 894 | Zhipu | Proprietary |
| 239 | Llama3.1-70B-InstructFacebook AI研究实验室 | 1329.00 | +/-15 | 1,312 | Facebook AI研究实验室 | Llama 3.1 |
| 240 | ibm-granite-h-smallIBM | 1328.00 | +/-17 | 1,268 | IBM | Apache 2.0 |
| 241 | GPT-4OpenAI | 1328.00 | +/-9 | 8,306 | OpenAI | Proprietary |
| 242 | Gemma 3 - 12B (IT)Google Deep Mind | 1317.00 | +/-23 | 543 | Google Deep Mind | Gemma |
| 243 | Claude3-SonnetAnthropic | 1317.00 | +/-7 | 18,888 | Anthropic | Proprietary |
| 244 | gemini-1.5-flash-002Google | 1316.00 | +/-8 | 5,892 | Proprietary | |
| 245 | reka-core-20240904Reka AI | 1315.00 | +/-15 | 1,216 | Reka AI | Proprietary |
| 246 | GPT-4OpenAI | 1313.00 | +/-8 | 13,719 | OpenAI | Proprietary |
| 247 | Mistral Small 24B Instruct 2501MistralAI | 1312.00 | +/-12 | 2,083 | MistralAI | Apache 2.0 |
| 248 | jamba-1.5-largeAI21 Labs | 1312.00 | +/-15 | 1,440 | AI21 Labs | Jamba Open |
| 249 | llama-3.1-nemotron-51b-instructNvidia | 1311.00 | +/-22 | 665 | Nvidia | Llama 3.1 |
| 250 | gemini-1.5-flash-001Google | 1310.00 | +/-8 | 10,680 | Proprietary | |
| 251 | Gemma-3n-E4BGoogle Deep Mind | 1309.00 | +/-10 | 3,532 | Google Deep Mind | Gemma |
| 252 | GLM4智谱AI | 1308.00 | +/-14 | 1,718 | 智谱AI | Proprietary |
| 253 | llama-3.1-tulu-3-70bAi2 | 1307.00 | +/-24 | 450 | Ai2 | Llama 3.1 |
| 254 | nemotron-4-340b-instructNvidia | 1307.00 | +/-11 | 3,254 | Nvidia | NVIDIA Open Model |
| 255 | Phi 4 - 14BMicrosoft Azure | 1306.00 | +/-10 | 3,305 | Microsoft Azure | MIT |
| 256 | amazon-nova-lite-v1.0Amazon | 1306.00 | +/-10 | 3,060 | Amazon | Proprietary |
| 257 | Llama3-70B-InstructFacebook AI研究实验室 | 1305.00 | +/-7 | 28,126 | Facebook AI研究实验室 | Llama 3 Community |
| 258 | gemma-2-27b-itGoogle | 1305.00 | +/-6 | 12,088 | Gemma license | |
| 259 | hunyuan-standard-256kTencent | 1301.00 | +/-25 | 497 | Tencent | Proprietary |
| 260 | Claude3-HaikuAnthropic | 1300.00 | +/-7 | 20,898 | Anthropic | Proprietary |
| 261 | Qwen2-72B-Instruct阿里巴巴 | 1296.00 | +/-9 | 6,249 | 阿里巴巴 | Qianwen LICENSE |
| 262 | mistral-large-2402Mistral | 1294.00 | +/-9 | 10,418 | Mistral | Proprietary |
| 263 | C4AI Aya Vision 32BCohereAI | 1292.00 | +/-9 | 4,685 | CohereAI | CC-BY-NC-4.0 |
| 264 | reka-flash-20240904Reka AI | 1290.00 | +/-15 | 1,207 | Reka AI | Proprietary |
| 265 | amazon-nova-micro-v1.0Amazon | 1288.00 | +/-10 | 2,981 | Amazon | Proprietary |
| 266 | Llama3.1-8B-InstructFacebook AI研究实验室 | 1287.00 | +/-26 | 478 | Facebook AI研究实验室 | Apache 2.0 |
| 267 | command-r-08-2024Cohere | 1280.00 | +/-13 | 1,783 | Cohere | CC-BY-NC-4.0 |
| 268 | olmo-2-0325-32b-instructAi2 | 1279.00 | +/-27 | 427 | Ai2 | Apache-2.0 |
| 269 | command-r-plus-08-2024Cohere | 1279.00 | +/-14 | 1,675 | Cohere | CC-BY-NC-4.0 |
| 270 | Qwen1.5-110B-Chat阿里巴巴 | 1279.00 | +/-10 | 4,763 | 阿里巴巴 | Qianwen LICENSE |
| 271 | reka-flash-21b-20240226-onlineReka AI | 1276.00 | +/-13 | 2,879 | Reka AI | Proprietary |
| 272 | Mixtral-8x22B-Instruct-v0.1MistralAI | 1276.00 | +/-9 | 8,780 | MistralAI | Apache 2.0 |
| 273 | Gemma 3 - 4B (IT)Google Deep Mind | 1275.00 | +/-24 | 605 | Google Deep Mind | Gemma |
| 274 | ministral-8b-2410Mistral | 1274.00 | +/-19 | 838 | Mistral | MRL |
| 275 | Qwen1.5-72B-Chat阿里巴巴 | 1274.00 | +/-10 | 6,370 | 阿里巴巴 | Qianwen LICENSE |
| 276 | gpt-3.5-turbo-0125OpenAI | 1273.00 | +/-8 | 11,130 | OpenAI | Proprietary |
| 277 | gemini-1.5-flash-8b-001Google | 1272.00 | +/-8 | 6,069 | Proprietary | |
| 278 | gemma-2-9b-it-simpoPrinceton | 1272.00 | +/-15 | 1,471 | Princeton | MIT |
| 279 | C4AI Command R+CohereAI | 1271.00 | +/-8 | 13,937 | CohereAI | CC-BY-NC-4.0 |
| 280 | gemma-2-9b-itGoogle | 1271.00 | +/-7 | 8,921 | Gemma license | |
| 281 | reka-flash-21b-20240226Reka AI | 1266.00 | +/-11 | 4,748 | Reka AI | Proprietary |
| 282 | jamba-1.5-miniAI21 Labs | 1265.00 | +/-15 | 1,352 | AI21 Labs | Jamba Open |
| 283 | mistral-mediumMistral | 1261.00 | +/-10 | 5,149 | Mistral | Proprietary |
| 284 | gpt-3.5-turbo-1106OpenAI | 1261.00 | +/-16 | 2,121 | OpenAI | Proprietary |
| 285 | qwen1.5-32b-chatAlibaba | 1261.00 | +/-11 | 3,930 | Alibaba | Qianwen LICENSE |
| 286 | Llama3.1-8B-InstructFacebook AI研究实验室 | 1260.00 | +/-7 | 8,582 | Facebook AI研究实验室 | Llama 3.1 Community |
| 287 | C4AI Aya Vision 8BCohereAI | 1255.00 | +/-15 | 1,567 | CohereAI | CC-BY-NC-4.0 |
| 288 | llama-3.1-tulu-3-8bAi2 | 1253.00 | +/-25 | 476 | Ai2 | Llama 3.1 |
| 289 | Llama3-8B-InstructFacebook AI研究实验室 | 1252.00 | +/-8 | 18,374 | Facebook AI研究实验室 | Llama 3 Community |
| 290 | DBRX Instructdatabricks | 1250.00 | +/-11 | 5,502 | databricks | DBRX LICENSE |
| 291 | granite-3.1-2b-instructIBM | 1248.00 | +/-25 | 508 | IBM | Apache 2.0 |
| 292 | Gemini-proDeepMind | 1248.00 | +/-24 | 678 | DeepMind | Proprietary |
| 293 | InternLM2-Base-20B上海人工智能实验室 | 1247.00 | +/-14 | 1,684 | 上海人工智能实验室 | Other |
| 294 | Yi-1.5-34B零一万物 | 1247.00 | +/-10 | 3,841 | 零一万物 | Apache-2.0 |
| 295 | zephyr-orpo-141b-A35b-v0.1HuggingFace | 1244.00 | +/-21 | 831 | HuggingFace | Apache 2.0 |
| 296 | command-rCohere | 1242.00 | +/-9 | 9,645 | Cohere | CC-BY-NC-4.0 |
| 297 | granite-3.0-8b-instructIBM | 1239.00 | +/-18 | 1,108 | IBM | Apache 2.0 |
| 298 | gemini-pro-dev-apiGoogle | 1238.00 | +/-14 | 2,681 | Proprietary | |
| 299 | Qwen1.5-14B-Chat阿里巴巴 | 1238.00 | +/-13 | 3,208 | 阿里巴巴 | Qianwen LICENSE |
| 300 | mixtral-8x7b-instruct-v0.1Mistral | 1238.00 | +/-8 | 11,784 | Mistral | Apache 2.0 |
| 301 | starling-lm-7b-betaNexusflow | 1234.00 | +/-13 | 2,948 | Nexusflow | Apache-2.0 |
| 302 | Phi-3-medium 14B-previewMicrosoft Azure | 1230.00 | +/-10 | 3,973 | Microsoft Azure | MIT |
| 303 | openchat-3.5-0106OpenChat | 1228.00 | +/-14 | 2,005 | OpenChat | Apache-2.0 |
| 304 | snowflake-arctic-instructSnowflake | 1223.00 | +/-11 | 5,734 | Snowflake | Apache 2.0 |
| 305 | DeepSeek LLM 67B ChatDeepSeek-AI | 1216.00 | +/-24 | 649 | DeepSeek-AI | DeepSeek License |
| 306 | Gemma 1.1-7B-ITGoogle Research | 1216.00 | +/-10 | 4,332 | Google Research | Gemma license |
| 307 | tulu-2-dpo-70bAllenAI/UW | 1213.00 | +/-21 | 805 | AllenAI/UW | AI2 ImpACT Low-risk |
| 308 | Qwen1.5-7B-Chat阿里巴巴 | 1208.00 | +/-21 | 772 | 阿里巴巴 | Qianwen LICENSE |
| 309 | Qwen3-VL-2B阿里巴巴 | 1208.00 | +/-18 | 1,134 | 阿里巴巴 | Apache 2.0 |
| 310 | starling-lm-7b-alphaUC Berkeley | 1206.00 | +/-16 | 1,397 | UC Berkeley | CC-BY-NC-4.0 |
| 311 | Yi-34B零一万物 | 1204.00 | +/-13 | 2,345 | 零一万物 | Yi License |
| 312 | Phi-3-small 7BMicrosoft Azure | 1203.00 | +/-12 | 3,219 | Microsoft Azure | MIT |
| 313 | openchat-3.5OpenChat | 1201.00 | +/-20 | 971 | OpenChat | Apache-2.0 |
| 314 | Qwen-14B-Chat阿里巴巴 | 1196.00 | +/-24 | 599 | 阿里巴巴 | Qianwen LICENSE |
| 315 | Phi-3-mini 3.8BMicrosoft Azure | 1196.00 | +/-14 | 1,841 | Microsoft Azure | MIT |
| 316 | gemma-2-2b-itGoogle | 1193.00 | +/-8 | 7,298 | Gemma license | |
| 317 | Vicuna 33BLM-SYS | 1192.00 | +/-13 | 2,866 | LM-SYS | Non-commercial |
| 318 | WizardLM-70B-V1.0WizardLM Team | 1192.00 | +/-20 | 988 | WizardLM Team | Llama 2 Community |
| 319 | Phi-3-mini 3.8BMicrosoft Azure | 1186.00 | +/-12 | 3,449 | Microsoft Azure | MIT |
| 320 | openhermes-2.5-mistral-7bNousResearch | 1185.00 | +/-23 | 589 | NousResearch | Apache-2.0 |
| 321 | Mistral-7B-Instruct-v0.2MistralAI | 1184.00 | +/-12 | 3,114 | MistralAI | Apache-2.0 |
| 322 | solar-10.7b-instruct-v1.0Upstage AI | 1182.00 | +/-27 | 482 | Upstage AI | CC-BY-NC-4.0 |
| 323 | llama-2-70b-chatMeta | 1177.00 | +/-10 | 5,717 | Meta | Llama 2 Community |
| 324 | llama-3.2-3b-instructMeta | 1175.00 | +/-16 | 1,351 | Meta | Llama 3.2 |
| 325 | nous-hermes-2-mixtral-8x7b-dpoNousResearch | 1174.00 | +/-24 | 575 | NousResearch | Apache-2.0 |
| 326 | QwQ-32B-Preview阿里巴巴 | 1173.00 | +/-24 | 566 | 阿里巴巴 | Apache 2.0 |
| 327 | Gemma 1.1-2B-ITGoogle Research | 1171.00 | +/-14 | 1,963 | Google Research | Gemma license |
| 328 | Gemma 7B - ItGoogle Research | 1167.00 | +/-17 | 1,381 | Google Research | Gemma license |
| 329 | MPT-30B-ChatMosaicML | 1166.00 | +/-35 | 258 | MosaicML | CC-BY-NC-SA-4.0 |
| 330 | zephyr-7b-alphaHuggingFace | 1165.00 | +/-40 | 201 | HuggingFace | MIT |
| 331 | Vicuna 13BLM-SYS | 1162.00 | +/-14 | 2,389 | LM-SYS | Llama 2 Community |
| 332 | Baichuan2-13B-Chat百川智能 | 1161.00 | +/-13 | 2,626 | 百川智能 | Llama 2 Community |
| 333 | smollm2-1.7b-instructHuggingFace | 1159.00 | +/-33 | 352 | HuggingFace | Apache 2.0 |
| 334 | CodeLLaMA-34BFacebook AI研究实验室 | 1158.00 | +/-20 | 853 | Facebook AI研究实验室 | Llama 2 Community |
| 335 | Phi-3-mini 3.8BMicrosoft Azure | 1153.00 | +/-13 | 3,886 | Microsoft Azure | MIT |
| 336 | PaLM 2Google Research | 1152.00 | +/-21 | 917 | Google Research | Proprietary |
| 337 | zephyr-7b-betaHuggingFace | 1151.00 | +/-18 | 1,250 | HuggingFace | MIT |
| 338 | wizardlm-13bMicrosoft | 1150.00 | +/-22 | 735 | Microsoft | Llama 2 Community |
| 339 | llama-3.2-1b-instructMeta | 1148.00 | +/-16 | 1,346 | Meta | Llama 3.2 |
| 340 | llama2-70b-steerlm-chatNvidia | 1144.00 | +/-28 | 467 | Nvidia | Llama 2 Community |
| 341 | Mistral 7B InstructMistralAI | 1143.00 | +/-20 | 1,032 | MistralAI | Apache 2.0 |
| 342 | Gemma 2B - ItGoogle Research | 1136.00 | +/-22 | 742 | Google Research | Gemma license |
| 343 | Vicuna 7BLM-SYS | 1130.00 | +/-23 | 726 | LM-SYS | Llama 2 Community |
| 344 | Qwen1.5-4B-Chat阿里巴巴 | 1130.00 | +/-17 | 1,283 | 阿里巴巴 | Qianwen LICENSE |
| 345 | stripedhyena-nous-7bTogether AI | 1126.00 | +/-22 | 704 | Together AI | Apache 2.0 |
| 346 | guanaco-33bUW | 1112.00 | +/-36 | 263 | UW | Non-commercial |
| 347 | olmo-7b-instructAi2 | 1106.00 | +/-22 | 772 | Ai2 | Apache-2.0 |
| 348 | Baichuan2-7B-Chat百川智能 | 1101.00 | +/-14 | 1,956 | 百川智能 | Llama 2 Community |
| 349 | ChatGLM3-6B智谱AI | 1089.00 | +/-26 | 535 | 智谱AI | Apache-2.0 |
| 350 | MPT-7B-ChatMosaicML | 1064.00 | +/-31 | 397 | MosaicML | CC-BY-NC-SA-4.0 |
| 351 | Koala达摩院 | 1064.00 | +/-24 | 747 | 达摩院 | Non-commercial |
| 352 | RWKV-4-Raven-14BRWKV | 1058.00 | +/-27 | 505 | RWKV | Apache 2.0 |
| 353 | oasst-pythia-12bOpenAssistant | 1049.00 | +/-25 | 714 | OpenAssistant | Apache 2.0 |
| 354 | ChatGLM-6B智谱AI | 1034.00 | +/-27 | 551 | 智谱AI | Non-commercial |
| 355 | ChatGLM2-6B智谱AI | 1029.00 | +/-35 | 293 | 智谱AI | Apache-2.0 |
| 356 | stablelm-tuned-alpha-7bStability AI | 1003.00 | +/-33 | 363 | Stability AI | CC-BY-NC-SA-4.0 |
| 357 | alpaca-13bStanford | 998.00 | +/-27 | 626 | Stanford | Non-commercial |
| 358 | dolly-v2-12bDatabricks | 961.00 | +/-34 | 396 | Databricks | MIT |
| 359 | fastchat-t5-3bLMSYS | 906.00 | +/-30 | 428 | LMSYS | Apache 2.0 |
| 360 | LLaMA 13BFacebook AI研究实验室 | 881.00 | +/-39 | 304 | Facebook AI研究实验室 | Non-commercial |
数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。
常见问题 (FAQ)
什么是 LMArena Coding Arena?
LMArena Coding Arena 是 LMArena 旗下专注于代码能力的匿名评测平台。用户提交真实编程任务(如调试、代码生成、算法实现),系统将不同模型的输出并排展示(隐藏模型名称),由用户投票选出更好的答案,最终通过 Elo 算法汇总形成动态排行榜。
Coding Arena 与 SWE-bench、HumanEval 等静态基准有什么区别?
SWE-bench、HumanEval、MBPP 等静态基准使用固定测试集和自动化评分,可重现性强但容易被针对性优化("刷榜")。Coding Arena 来自真实用户的开放式需求,测试内容不固定,更能反映模型在实际编程场景中的表现,两者互为补充。
国产大模型在代码能力方面表现如何?
DeepSeek、Qwen 等国产模型在 Coding Arena 表现亮眼,已跻身全球前列。DeepSeek 以 MIT 协议开源,Qwen 系列支持中文编程场景,是开发者选择开源代码模型的重要参考。
如何用 AI 辅助日常编程工作?
常见场景包括:代码补全与生成、调试、代码审查、单元测试生成,以及跨语言翻译。















