DataLearner 标志DataLearnerAI
最新AI资讯
大模型排行榜
大模型评测基准
大模型列表
大模型对比
资源中心
工具
语言中文
DataLearner 标志DataLearner AI

专注大模型评测、数据资源与实践教学的知识平台,持续更新可落地的 AI 能力图谱。

产品

  • 评测榜单
  • 模型对比
  • 数据资源

资源

  • 部署教程
  • 原创内容
  • 工具导航

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner 持续整合行业数据与案例,为科研、企业与开发者提供可靠的大模型情报与实践指南。

隐私政策服务条款
首页综合排行榜Artificial Analysis Intelligence Index AI模型智能指数排行榜

Artificial Analysis Intelligence Index AI模型智能指数排行榜

Artificial Analysis Intelligence Index v4.0 综合了10项权威评测基准(GDPval-AA、Terminal-Bench、GPQA Diamond、SciCode等),从数学、科学、编程、推理等多维度对AI模型进行全面评估和排名。

榜首模型

GPT-5.5 (xhigh)

最高得分

60

模型数量

212

数据版本

2026年05月10日

数据来源: Artificial Analysis

来源:全部国产模型
榜单历史快照月份:

排名总表

排名模型名称智能指数机构
OpenAIGPT-5.5 (xhigh)OpenAI60OpenAI
OpenAIGPT-5.5 (high)OpenAI59OpenAI
AnthropicOpus 4.7 (max)Anthropic57Anthropic
4Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind57Google Deep Mind
5OpenAIGPT-5.5 (medium)OpenAI57OpenAI
6Moonshot AIKimi K2.6Moonshot AI54Moonshot AI
7XIMiMo-V2.5-ProXiaomi54Xiaomi
8OpenAIGPT-5.3 Codex (xhigh)OpenAI54OpenAI
9xAIGrok 4.3xAI53xAI
10FAMuse SparkFacebook AI研究实验室52Facebook AI研究实验室
11AnthropicOpus 4.7 (high)Anthropic52Anthropic
12阿里Qwen3.6-Max-Preview阿里巴巴52阿里巴巴
13AnthropicClaude Sonnet 4.6 (max)Anthropic52Anthropic
14DeepSeek-AIDeepSeek-V4-Pro (max)DeepSeek-AI52DeepSeek-AI
15智谱GLM 5.1智谱AI51智谱AI
16OpenAIGPT-5.5 (low)OpenAI51OpenAI
17阿里Qwen 3.6 Plus Preview阿里巴巴50阿里巴巴
18DeepSeek-AIDeepSeek-V4-Pro (high)DeepSeek-AI50DeepSeek-AI
19智谱GLM-5智谱AI50智谱AI
20MiniMaxAIMiniMax-M2.7MiniMaxAI50MiniMaxAI
21XIMiMo-V2.5Xiaomi49Xiaomi
22OpenAIGPT-5.4 mini (xhigh)OpenAI49OpenAI
23OpenAIGPT-5.4 (low)OpenAI48OpenAI
24智谱GLM-5-Turbo智谱AI47智谱AI
25DeepSeek-AIDeepSeek-V4-Flash (max)DeepSeek-AI47DeepSeek-AI
26Google Deep MindGemini 3.0 FlashGoogle Deep Mind46Google Deep Mind
27阿里Qwen3.6-27B阿里巴巴46阿里巴巴
28阿里Qwen3.5-397B-A17B阿里巴巴45阿里巴巴
29亚马Nova 2 Omni(Preview)亚马逊45亚马逊
30DeepSeek-AIDeepSeek-V4-Flash (high)DeepSeek-AI45DeepSeek-AI
31AnthropicClaude Sonnet 4.6 (non-reasoning)Anthropic44Anthropic
32OpenAIGPT-5.4 nano (xhigh)OpenAI44OpenAI
33智谱GLM 5.1智谱AI44智谱AI
34阿里Qwen3.6-35B-A3B阿里巴巴43阿里巴巴
35XIMiMo-V2-OmniXiaomi43Xiaomi
36Moonshot AIKimi K2.6Moonshot AI43Moonshot AI
37智谱GLM-5V-Turbo智谱AI43智谱AI
38AnthropicClaude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic43Anthropic
39TencentHy3-previewTencent42Tencent
40阿里Qwen3.5-122B-A10B阿里巴巴42阿里巴巴
41DeepMindGemini 2.0 Flash ExperimentalDeepMind41DeepMind
42Google Deep MindGemini 3.1 Pro Preview (low)Google Deep Mind41Google Deep Mind
43OpenAIGPT-5.5 (non-reasoning)OpenAI41OpenAI
44智谱GLM-5智谱AI41智谱AI
45阿里Qwen3.5-397B-A17B阿里巴巴40阿里巴巴
46DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI39DeepSeek-AI
47MistralMistral Medium 3.5Mistral39Mistral
48DeepMindGemma 4 31BDeepMind39DeepMind
49阿里Qwen3.5-Omni-Plus阿里巴巴39阿里巴巴
50xAIGrok 4.1 FastxAI39xAI
51StepFunAIStep 3.5 FlashStepFunAI38StepFunAI
52OpenAIOpenAI o3OpenAI38OpenAI
53OpenAIGPT-5.4 nanoOpenAI38OpenAI
54OpenAIGPT-5.4 mini (medium)OpenAI38OpenAI
55Moonshot AIKimi K2.5Moonshot AI37Moonshot AI
56阿里Qwen3.6-27B阿里巴巴37阿里巴巴
57AnthropicHaiku 4.5Anthropic37Anthropic
58DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI36DeepSeek-AI
59NVIDIANVIDIA Nemotron 3 SuperNVIDIA36NVIDIA
60阿里Qwen3.5-122B-A10B阿里巴巴36阿里巴巴
61亚马Nova 2 Pro(Preview) (medium)亚马逊36亚马逊
62XIMiMo-V2.5-ProXiaomi36Xiaomi
63OpenAIGPT-5.4 (non-reasoning)OpenAI35OpenAI
64Google Deep MindGemini 3.0 FlashGoogle Deep Mind35Google Deep Mind
65Google Deep MindGemini 2.5-ProGoogle Deep Mind35Google Deep Mind
66亚马Nova 2 Lite (high)亚马逊35亚马逊
67TencentHy3-previewTencent34Tencent
68INLing-2.6-1TInclusionAI34InclusionAI
69ByteDance SeedDoubao Seed CodeByteDance Seed34ByteDance Seed
70GoogleGemini 3.1 Flash-Lite PreviewGoogle34Google
71OpenAIGPT OSS 120B (high)OpenAI33OpenAI
72INMercury 2Inception33Inception
73阿里Qwen3.5-9B-Instruct阿里巴巴32阿里巴巴
74DeepMindGemma 4 31BDeepMind32DeepMind
75LGK-EXAONELG AI Research32LG AI Research
76xAIGrok-3 mini - Reasoning (high)xAI32xAI
77亚马Nova 2 Pro(Preview) (low)亚马逊32亚马逊
78ARTrinity Large ThinkingArcee AI32Arcee AI
79阿里Qwen3.6-35B-A3B阿里巴巴32阿里巴巴
80DeepMindGemma 4 26B A4BDeepMind31DeepMind
81AnthropicHaiku 4.5Anthropic31Anthropic
82xAIGrok 4.3xAI31xAI
83阿里Qwen3.5-35B-A3B阿里巴巴31阿里巴巴
84XIMiMo-V2-FlashXiaomi30Xiaomi
85LGEXAONE 4.5 33BLG AI Research30LG AI Research
86亚马Nova 2 Lite (medium)亚马逊30亚马逊
87百度ERNIE 5.0百度29百度
88xAIGrok 4.20 0309 v2xAI29xAI
89xAIGrok Code Fast 1xAI29xAI
90NVIDIANemotron Cascade 2 30B A3BNVIDIA28NVIDIA
91阿里Qwen3-Coder-Next阿里巴巴28阿里巴巴
92亚马Nova 2 Omni(Preview) (medium)亚马逊28亚马逊
93MistralMistral Small 4Mistral28Mistral
94阿里Qwen3.5-9B-Instruct阿里巴巴27阿里巴巴
95MistralMagistral Medium 1.2Mistral27Mistral
96DeepMindGemma 4 26B A4BDeepMind27DeepMind
97AlibabaQwen3.5 4BAlibaba27Alibaba
98DeepSeek-AIDeepSeek-R1-0528DeepSeek-AI27DeepSeek-AI
99阿里Qwen3-Next阿里巴巴27阿里巴巴
100INLing 2.6 FlashInclusionAI26InclusionAI
101阿里Qwen3.5-Omni-Flash阿里巴巴26阿里巴巴
102UPSolar Pro 3Upstage26Upstage
103CHJT-MINIChina Mobile25China Mobile
104亚马Nova 2 Lite (low)亚马逊25亚马逊
105OpenAIGPT OSS 20B (high)OpenAI24OpenAI
106OpenAIGPT OSS 120B (low)OpenAI24OpenAI
107OpenAIGPT-5.4 nanoOpenAI24OpenAI
108NVIDIANVIDIA Nemotron 3 NanoNVIDIA24NVIDIA
109LOLongCat Flash LiteLongCat24LongCat
110xAIGrok 4.1 FastxAI24xAI
111LGK-EXAONELG AI Research23LG AI Research
112OpenAIGPT-5.4 miniOpenAI23OpenAI
113亚马Nova 2 Omni(Preview) (low)亚马逊23亚马逊
114亚马Nova 2 Pro(Preview)亚马逊23亚马逊
115KOMi:dm K 2.5 ProKorea Telecom23Korea Telecom
116MistralAIMistral Large 3MistralAI23MistralAI
117INRing-1TInclusionAI23InclusionAI
118AlibabaQwen3.5 4BAlibaba23Alibaba
119PRINTELLECT-3Prime Intellect22Prime Intellect
120MistralDevstral 2Mistral22Mistral
121UPSolar Open 100BUpstage22Upstage
122Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025Google Deep Mind22Google Deep Mind
123NVIDIANemotron 3 Nano Omni 30B A3B ReasoningNVIDIA21NVIDIA
124OpenAIGPT OSS 20B (low)OpenAI21OpenAI
125阿里Qwen3-Next阿里巴巴20阿里巴巴
126MistralDevstral Small 2Mistral19Mistral
127Google Deep MindGemini 2.5 Flash-Lite-Preview-09-2025Google Deep Mind19Google Deep Mind
128MOMotif-2-12.7BMotif Technologies19Motif Technologies
129INLing-1TInclusionAI19InclusionAI
130AmazonNova PremierAmazon19Amazon
131DeepMindGemma 4 E4BDeepMind19DeepMind
132MetaLlama Nemotron Super 49B v1.5Meta19Meta
133MistralMistral Small 4Mistral19Mistral
134MetaLlama 3.3 Nemotron Super 49BMeta18Meta
135FALlama 4 MaverickFacebook AI研究实验室18Facebook AI研究实验室
136SASarvam 105B (high)Sarvam18Sarvam
137MistralMagistral Small 1.2Mistral18Mistral
138亚马Nova 2 Lite亚马逊18亚马逊
139FALlama3.1-405BFacebook AI研究实验室17Facebook AI研究实验室
140LGEXAONE 4.0 32BLG AI Research17LG AI Research
141亚马Nova 2 Omni(Preview)亚马逊17亚马逊
142AlibabaQwen3.5 2BAlibaba16Alibaba
143NANanbeige4.1-3BNanbeige16Nanbeige
144MistralAIMinistral 3 14BMistralAI16MistralAI
145DeepSeek-AIDeepSeek-R1-Distill-Llama-70BDeepSeek-AI16DeepSeek-AI
146TIFalcon-H1R-7BTII UAE16TII UAE
147INLing-flash-2.0InclusionAI16InclusionAI
148阿里Qwen3-Omni-30B-A3B阿里巴巴16阿里巴巴
149StepFunStep3 VL 10BStepFun15StepFun
150DeepMindGemma 4 E2BDeepMind15DeepMind
151NVIDIALlama Nemotron UltraNVIDIA15NVIDIA
152百度ERNIE-4.5-300B-A47B百度15百度
153UPSolar Pro 2Upstage15Upstage
154NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA15NVIDIA
155MistralAIMinistral 3 8BMistralAI15MistralAI
156DeepMindGemma 4 E4BDeepMind15DeepMind
157NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA15NVIDIA
158IBGranite 4.1 30BIBM15IBM
159NVIDIANVIDIA Nemotron 3 Nano 4BNVIDIA15NVIDIA
160AlibabaQwen3.5 2BAlibaba15Alibaba
161MetaLlama Nemotron Super 49B v1.5Meta15Meta
162FALlama3.3-70B-InstructFacebook AI研究实验室14Facebook AI研究实验室
163MetaLlama 3.1 Nemotron Nano 4B v1.1Meta14Meta
164KimiKimi Linear 48B A3B InstructKimi14Kimi
165MetaLlama 3.3 Nemotron Super 49BMeta14Meta
166INRing-flash-2.0InclusionAI14InclusionAI
167UPSolar Pro 2Upstage14Upstage
168FALlama 4 ScoutFacebook AI研究实验室14Facebook AI研究实验室
169CohereAIC4AI Command A (202503)CohereAI13CohereAI
170NVIDIALlama 3.1 Nemotron 70BNVIDIA13NVIDIA
171NVIDIANVIDIA Nemotron 3 NanoNVIDIA13NVIDIA
172NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA13NVIDIA
173IBGranite 4.1 8BIBM12IBM
174SASarvam 30B (high)Sarvam12Sarvam
175DeepMindGemma 4 E2BDeepMind12DeepMind
176PerplexityR1 1776Perplexity12Perplexity
177FALlama 3.2-Vision-90BFacebook AI研究实验室12Facebook AI研究实验室
178LGEXAONE 4.0 32BLG AI Research12LG AI Research
179MistralMinistral 3 3BMistral11Mistral
180AIJamba 1.7 LargeAI21 Labs11AI21 Labs
181IBGranite 4.0 H SmallIBM11IBM
182阿里Qwen3-Omni-30B-A3B阿里巴巴11阿里巴巴
183AlibabaQwen3.5 0.8BAlibaba11Alibaba
184LILFM2 24B A2BLiquid AI10Liquid AI
185Microsoft AzurePhi 4 - 14BMicrosoft Azure10Microsoft Azure
186亚马Amazon Nova Micro亚马逊10亚马逊
187NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA10NVIDIA
188Microsoft AzurePhi-4-multimodal-instruct Microsoft Azure10Microsoft Azure
189AlibabaQwen3.5 0.8BAlibaba10Alibaba
190AIJamba Reasoning 3BAI21 Labs10AI21 Labs
191Google Deep MindGemini 3.0 FlashGoogle Deep Mind10Google Deep Mind
192INLing-mini-2.0InclusionAI9InclusionAI
193FALlama 3.2-Vision-11BFacebook AI研究实验室9Facebook AI研究实验室
194IBGranite 4.1 3BIBM9IBM
195Microsoft AzurePhi-4-mini-instruct (3.8B)Microsoft Azure8Microsoft Azure
196LGExaone 4.0 1.2BLG AI Research8LG AI Research
197LGExaone 4.0 1.2BLG AI Research8LG AI Research
198LILFM2.5-1.2B-ThinkingLiquid AI8Liquid AI
199AIJamba 1.7 MiniAI21 Labs8AI21 Labs
200LILFM2.5-1.2B-InstructLiquid AI8Liquid AI
201LILFM2 2.6BLiquid AI8Liquid AI
202IBGranite 4.0 H 1BIBM8IBM
203Google Deep MindGemma 3-270MGoogle Deep Mind8Google Deep Mind
204SWApertus 70B InstructSwiss AI8Swiss AI
205IBGranite 4.0 MicroIBM8IBM
206IBGranite 4.0 1BIBM7IBM
207LILFM2 8B A1BLiquid AI7Liquid AI
208LILFM2.5-VL-1.6BLiquid AI6Liquid AI
209IBGranite 4.0 350MIBM6IBM
210SWApertus 8B InstructSwiss AI6Swiss AI
211IBGranite 4.0 H 350MIBM5IBM
212CohereTiny Aya GlobalCohere5Cohere

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

评测基准组成(Intelligence Index v4.0)

Intelligence Index 综合10项严格的评测基准,全面衡量AI模型能力,避免单一维度的过拟合。

GDPval-AA
智能体真实任务
τ²-Bench
智能体工具调用
Terminal-Bench
智能体编程
SciCode
编程能力
AA-LCR
长上下文推理
AA-Omniscience
知识与幻觉检测
IFBench
指令遵循
Humanity's Last Exam
推理与知识
GPQA Diamond
科学推理
CritPt
物理推理

常见问题 (FAQ)

什么是 Artificial Analysis Intelligence Index?▼
Artificial Analysis Intelligence Index v4.0 是一个综合评测指数,聚合了10项具有挑战性的评估——涵盖数学、科学、编程、智能体任务和推理——以全面衡量AI能力。它旨在防止单一维度的过拟合,提供一个统一分数来追踪模型进步。
智能指数是如何计算的?▼
该指数综合了10项评测的分数:GDPval-AA(智能体真实任务)、τ²-Bench(工具调用)、Terminal-Bench Hard(智能体编程)、SciCode(编程)、AA-LCR(长上下文推理)、AA-Omniscience(知识与幻觉检测)、IFBench(指令遵循)、Humanity's Last Exam(推理)、GPQA Diamond(科学推理)和 CritPt(物理推理)。所有测试由 Artificial Analysis 在标准化硬件上独立运行。
这与 LMArena 排行榜有什么区别?▼
LMArena 排名基于众包用户投票(盲测A/B对比的Elo评分),反映主观的人类偏好。而 Artificial Analysis Intelligence Index 使用标准化的自动评测基准进行客观评分,衡量特定领域的技术能力。两者各有价值——LMArena 捕捉真实用户体验,而 AA Intelligence Index 提供可复现的技术测量。
在哪里可以找到原始数据?▼
原始排行榜和详细方法论可在 artificialanalysis.ai 查看。Intelligence Index 的方法论详见 Intelligence Index 页面。