Artificial Analysis Intelligence Index AI模型智能指数排行榜

Artificial Analysis Intelligence Index v4.0 综合了10项权威评测基准(GDPval-AA、Terminal-Bench、GPQA Diamond、SciCode等),从数学、科学、编程、推理等多维度对AI模型进行全面评估和排名。

榜首模型

Claude Fable 5

最高得分

60

模型数量

214

数据版本

2026年06月19日

数据来源: Artificial Analysis

榜单历史快照月份:

排名总表

排名模型名称智能指数机构
AnthropicClaude Fable 5Anthropic60Anthropic
AnthropicClaude Opus 4.8 (max)Anthropic56Anthropic
OpenAIGPT-5.5 (xhigh)OpenAI55OpenAI
4AnthropicOpus 4.7 (max)Anthropic54Anthropic
5OpenAIGPT-5.5 (high)OpenAI53OpenAI
6GLM-5.2 (max)智谱AI51智谱AI
7Google Deep MindGemini 3.5 FlashGoogle Deep Mind50Google Deep Mind
8AnthropicClaude Sonnet 4.6 (max)Anthropic47Anthropic
9OpenAIGPT-5.5 (medium)OpenAI47OpenAI
10Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind46Google Deep Mind
11Qwen3.7 Max阿里巴巴46阿里巴巴
12GoogleGemini 3.5 Flash (medium)Google45Google
13MiniMaxMiniMax-M3MiniMax44MiniMax
14DeepSeek-AIDeepSeek-V4-Pro (max)DeepSeek-AI44DeepSeek-AI
15OpenAIGPT-5.3 Codex (xhigh)OpenAI44OpenAI
16Muse SparkFacebook AI研究实验室43Facebook AI研究实验室
17Moonshot AIKimi K2.6Moonshot AI43Moonshot AI
18AnthropicOpus 4.7 (high)Anthropic43Anthropic
19MiMo-V2.5-ProXiaomi42Xiaomi
20KimiKimi K2.7 CodeKimi42Kimi
21OpenAIGPT-5.5 (low)OpenAI42OpenAI
22DeepSeek-AIDeepSeek-V4-Pro (high)DeepSeek-AI41DeepSeek-AI
23DeepSeek-AIDeepSeek-V4-Flash (max)DeepSeek-AI40DeepSeek-AI
24GLM 5.1智谱AI40智谱AI
25MiMo-V2.5Xiaomi40Xiaomi
26OpenAIGPT-5.4 mini (xhigh)OpenAI40OpenAI
27Qwen 3.6 Plus Preview阿里巴巴40阿里巴巴
28AlibabaQwen3.7 PlusAlibaba39Alibaba
29OpenAIGPT-5.4 nano (xhigh)OpenAI38OpenAI
30MiniMaxAIMiniMax-M2.7MiniMaxAI38MiniMaxAI
31GLM-5-Turbo智谱AI38智谱AI
32NVIDIANemotron 3 UltraNVIDIA38NVIDIA
33xAIGrok 4.3 Beta (high)xAI38xAI
34DeepSeek-AIDeepSeek-V4-Flash (high)DeepSeek-AI37DeepSeek-AI
35Qwen3.6-27B阿里巴巴37阿里巴巴
36Nova 2 Omni(Preview)亚马逊36亚马逊
37xAIGrok 4.3 Beta (medium)xAI36xAI
38AnthropicClaude Sonnet 4.6 (non-reasoning)Anthropic36Anthropic
39xAIGrok 4.3 Beta (low)xAI35xAI
40GLM 5.1智谱AI35智谱AI
41MiMo-V2-OmniXiaomi35Xiaomi
42Google Deep MindGemini 3.5 Flash (minimal)Google Deep Mind35Google Deep Mind
43Moonshot AIKimi K2.6Moonshot AI35Moonshot AI
44GLM-5V-Turbo智谱AI34智谱AI
45AnthropicClaude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic34Anthropic
46Qwen3.5-397B-A17B阿里巴巴34阿里巴巴
47Hy3 Pre腾讯AI实验室34腾讯AI实验室
48OpenAIGPT-5.5 Instant (May 2026)OpenAI34OpenAI
49DeepMindGemini 2.0 Flash ExperimentalDeepMind33DeepMind
50OpenAIGPT-5.5 (non-reasoning)OpenAI33OpenAI
51Qwen3.5-122B-A10B阿里巴巴32阿里巴巴
52Qwen3.5-397B-A17B阿里巴巴32阿里巴巴
53Qwen3.6-35B-A3B阿里巴巴32阿里巴巴
54DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI31DeepSeek-AI
55Qwen3.5-Omni-Plus阿里巴巴31阿里巴巴
56Ring-2.6-1TInclusionAI31InclusionAI
57OpenAIOpenAI o3OpenAI30OpenAI
58OpenAIGPT-5.4 nanoOpenAI30OpenAI
59MistralAIMistral Medium 3.5MistralAI30MistralAI
60OpenAIGPT-5.4 mini (medium)OpenAI30OpenAI
61StepFunStep 3.7 FlashStepFun30StepFun
62AnthropicHaiku 4.5Anthropic30Anthropic
63DeepMindGemma 4 31BDeepMind29DeepMind
64CohereAIC4AI Command A (202503)CohereAI29CohereAI
65Qwen3.6-27B阿里巴巴29阿里巴巴
66DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI29DeepSeek-AI
67JT-35B-FlashChina Mobile28China Mobile
68Qwen3.5-122B-A10B阿里巴巴28阿里巴巴
69MiMo-V2.5-ProXiaomi28Xiaomi
70Google Deep MindGemini 2.5-ProGoogle Deep Mind27Google Deep Mind
71Hy3 Pre腾讯AI实验室26腾讯AI实验室
72Ling-2.6-1TInclusionAI26InclusionAI
73StepFunAIStep 3.5 FlashStepFunAI26StepFunAI
74ByteDance SeedDoubao Seed CodeByteDance Seed26ByteDance Seed
75DeepMindGemma 4 26B A4BDeepMind26DeepMind
76NVIDIANVIDIA Nemotron 3 SuperNVIDIA25NVIDIA
77Mercury 2Inception25Inception
78GoogleGemini 3.1 Flash-LiteGoogle25Google
79Qwen3.5-9B-Instruct阿里巴巴25阿里巴巴
80DeepMindGemma 4 31BDeepMind25DeepMind
81xAIGrok 4.3 (Non-reasoning)xAI25xAI
82K-EXAONELG AI Research25LG AI Research
83Trinity Large ThinkingArcee AI24Arcee AI
84Qwen3.6-35B-A3B阿里巴巴24阿里巴巴
85OpenAIGPT OSS 120B (high)OpenAI24OpenAI
86AnthropicHaiku 4.5Anthropic24Anthropic
87Qwen3.5-35B-A3B阿里巴巴23阿里巴巴
88MiMo-V2-FlashXiaomi23Xiaomi
89EXAONE 4.5 33BLG AI Research23LG AI Research
90HyperNova 60B 2605Multiverse Computing22Multiverse Computing
91GoogleGemma 4 12BGoogle22Google
92ERNIE 5.0百度22百度
93Nova 2 Pro(Preview) (medium)亚马逊22亚马逊
94NVIDIANemotron Cascade 2 30B A3BNVIDIA21NVIDIA
95Qwen3-Coder-Next阿里巴巴21阿里巴巴
96Nova 2 Omni(Preview) (medium)亚马逊21亚马逊
97MistralMistral Small 4Mistral21Mistral
98CohereNorth Mini CodeCohere21Cohere
99Nova 2 Lite (high)亚马逊21亚马逊
100Qwen3.5-9B-Instruct阿里巴巴20阿里巴巴
101MistralMagistral Medium 1.2Mistral20Mistral
102DeepMindGemma 4 26B A4BDeepMind20DeepMind
103AlibabaQwen3.5 4BAlibaba20Alibaba
104Qwen3-Next阿里巴巴20阿里巴巴
105Nova 2 Pro(Preview) (low)亚马逊20亚马逊
106Ling 2.6 FlashInclusionAI19InclusionAI
107Nova 2 Lite (medium)亚马逊19亚马逊
108Qwen3.5-Omni-Flash阿里巴巴19阿里巴巴
109JT-MINIChina Mobile19China Mobile
110Nova 2 Lite (low)亚马逊18亚马逊
111OpenAIGPT OSS 120B (low)OpenAI18OpenAI
112OpenAIGPT-5.4 nanoOpenAI18OpenAI
113NVIDIANVIDIA Nemotron 3 NanoNVIDIA18NVIDIA
114LongCat Flash LiteLongCat17LongCat
115K-EXAONELG AI Research17LG AI Research
116OpenAIGPT-5.4 miniOpenAI17OpenAI
117Nova 2 Omni(Preview) (low)亚马逊17亚马逊
118Nova 2 Pro(Preview)亚马逊16亚马逊
119Mi:dm K 2.5 ProKorea Telecom16Korea Telecom
120MistralAIMistral Large 3MistralAI16MistralAI
121AlibabaQwen3.5 4BAlibaba16Alibaba
122INTELLECT-3Prime Intellect16Prime Intellect
123MistralDevstral 2Mistral15Mistral
124Solar Open 100BUpstage15Upstage
125Qwen3-Omni-30B-A3B (reasoning)阿里巴巴15阿里巴巴
126OpenAIGPT OSS 20B (high)OpenAI15OpenAI
127OpenAIGPT OSS 20B (low)OpenAI14OpenAI
128Llama 4 MaverickFacebook AI研究实验室14Facebook AI研究实验室
129Solar Pro 3Upstage14Upstage
130Qwen3-Next阿里巴巴14阿里巴巴
131GoogleGemma 4 12B (Non-reasoning)Google13Google
132MistralDevstral Small 2Mistral13Mistral
133Motif-2-12.7BMotif Technologies13Motif Technologies
134AmazonNova PremierAmazon13Amazon
135DeepMindGemma 4 E4BDeepMind12DeepMind
136MetaLlama Nemotron Super 49B v1.5Meta12Meta
137MistralMistral Small 4Mistral12Mistral
138MiniCPM5-1BOpenBMB12OpenBMB
139MistralMagistral Small 1.2Mistral12Mistral
140Sarvam 105B (high)Sarvam12Sarvam
141Nova 2 Lite亚马逊12亚马逊
142MiniCPM5-1BOpenBMB12OpenBMB
143EXAONE 4.0 32BLG AI Research11LG AI Research
144Nova 2 Omni(Preview)亚马逊11亚马逊
145AlibabaQwen3.5 2BAlibaba10Alibaba
146Nanbeige4.1-3BNanbeige10Nanbeige
147Llama 4 ScoutFacebook AI研究实验室10Facebook AI研究实验室
148MistralAIMinistral 3 14BMistralAI10MistralAI
149Falcon-H1R-7BTII UAE10TII UAE
150Qwen3-Omni-30B-A3B阿里巴巴10阿里巴巴
151StepFunStep3 VL 10BStepFun9StepFun
152DeepMindGemma 4 E2BDeepMind9DeepMind
153NVIDIALlama Nemotron UltraNVIDIA9NVIDIA
154ERNIE-4.5-300B-A47B百度9百度
155Solar Pro 2Upstage9Upstage
156NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA9NVIDIA
157MistralAIMinistral 3 8BMistralAI9MistralAI
158DeepMindGemma 4 E4BDeepMind9DeepMind
159Granite 4.1 30BIBM9IBM
160NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA9NVIDIA
161NVIDIANVIDIA Nemotron 3 Nano 4BNVIDIA9NVIDIA
162AlibabaQwen3.5 2BAlibaba9Alibaba
163MetaLlama Nemotron Super 49B v1.5Meta9Meta
164Llama3.3-70B-InstructFacebook AI研究实验室9Facebook AI研究实验室
165KimiKimi Linear 48B A3B InstructKimi9Kimi
166Llama3.1-405BFacebook AI研究实验室9Facebook AI研究实验室
167LFM2.5-8B-A1BLiquid AI8Liquid AI
168Ring-flash-2.0InclusionAI8InclusionAI
169Solar Pro 2Upstage8Upstage
170CohereAIC4AI Command A (202503)CohereAI8CohereAI
171NVIDIALlama 3.1 Nemotron 70BNVIDIA8NVIDIA
172NVIDIANVIDIA Nemotron 3 NanoNVIDIA7NVIDIA
173NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA7NVIDIA
174Granite 4.1 8BIBM7IBM
175Sarvam 30B (high)Sarvam7Sarvam
176DeepMindGemma 4 E2BDeepMind6DeepMind
177PerplexityR1 1776Perplexity6Perplexity
178Llama 3.2-Vision-90BFacebook AI研究实验室6Facebook AI研究实验室
179EXAONE 4.0 32BLG AI Research6LG AI Research
180MistralMinistral 3 3BMistral6Mistral
181Jamba 1.7 LargeAI21 Labs5AI21 Labs
182Granite 4.0 H SmallIBM5IBM
183Qwen3-Omni-30B-A3B阿里巴巴5阿里巴巴
184AlibabaQwen3.5 0.8BAlibaba5Alibaba
185LFM2 24B A2BLiquid AI5Liquid AI
186Microsoft AzurePhi 4 - 14BMicrosoft Azure5Microsoft Azure
187Amazon Nova Micro亚马逊5亚马逊
188NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA5NVIDIA
189Microsoft AzurePhi-4-multimodal-instruct Microsoft Azure5Microsoft Azure
190AlibabaQwen3.5 0.8BAlibaba4Alibaba
191MiniCPM-V 4.6 1.3BOpenBMB4OpenBMB
192Jamba Reasoning 3BAI21 Labs4AI21 Labs
193Google Deep MindGemini 3.0 FlashGoogle Deep Mind4Google Deep Mind
194Ling-mini-2.0InclusionAI4InclusionAI
195Llama 3.2-Vision-11BFacebook AI研究实验室3Facebook AI研究实验室
196Granite 4.1 3BIBM3IBM
197Microsoft AzurePhi-4-mini-instruct (3.8B)Microsoft Azure3Microsoft Azure
198Exaone 4.0 1.2BLG AI Research3LG AI Research
199Exaone 4.0 1.2BLG AI Research3LG AI Research
200LFM2.5-1.2B-ThinkingLiquid AI3Liquid AI
201Jamba 1.7 MiniAI21 Labs3AI21 Labs
202LFM2 2.6BLiquid AI3Liquid AI
203LFM2.5-1.2B-InstructLiquid AI3Liquid AI
204Granite 4.0 H 1BIBM3IBM
205Google Deep MindGemma 3-270MGoogle Deep Mind2Google Deep Mind
206Apertus 70B InstructSwiss AI2Swiss AI
207Granite 4.0 MicroIBM2IBM
208Granite 4.0 1BIBM2IBM
209LFM2 8B A1BLiquid AI2Liquid AI
210LFM2.5-VL-1.6BLiquid AI1Liquid AI
211Granite 4.0 350MIBM1IBM
212CohereTiny Aya GlobalCohere1Cohere
213Apertus 8B InstructSwiss AI1Swiss AI
214Granite 4.0 H 350MIBM1IBM

数据仅供参考,以官方来源为准。模型名称旁的链接可跳转到 DataLearner 模型详情页。

评测基准组成(Intelligence Index v4.0)

Intelligence Index 综合10项严格的评测基准,全面衡量AI模型能力,避免单一维度的过拟合。

GDPval-AA
智能体真实任务
τ²-Bench
智能体工具调用
Terminal-Bench
智能体编程
SciCode
编程能力
AA-LCR
长上下文推理
AA-Omniscience
知识与幻觉检测
IFBench
指令遵循
Humanity's Last Exam
推理与知识
GPQA Diamond
科学推理
CritPt
物理推理

常见问题 (FAQ)

什么是 Artificial Analysis Intelligence Index?
Artificial Analysis Intelligence Index v4.0 是一个综合评测指数,聚合了10项具有挑战性的评估——涵盖数学、科学、编程、智能体任务和推理——以全面衡量AI能力。它旨在防止单一维度的过拟合,提供一个统一分数来追踪模型进步。
智能指数是如何计算的?
该指数综合了10项评测的分数:GDPval-AA(智能体真实任务)、τ²-Bench(工具调用)、Terminal-Bench Hard(智能体编程)、SciCode(编程)、AA-LCR(长上下文推理)、AA-Omniscience(知识与幻觉检测)、IFBench(指令遵循)、Humanity's Last Exam(推理)、GPQA Diamond(科学推理)和 CritPt(物理推理)。所有测试由 Artificial Analysis 在标准化硬件上独立运行。
这与 LMArena 排行榜有什么区别?
LMArena 排名基于众包用户投票(盲测A/B对比的Elo评分),反映主观的人类偏好。而 Artificial Analysis Intelligence Index 使用标准化的自动评测基准进行客观评分,衡量特定领域的技术能力。两者各有价值——LMArena 捕捉真实用户体验,而 AA Intelligence Index 提供可复现的技术测量。
在哪里可以找到原始数据?
原始排行榜和详细方法论可在 artificialanalysis.ai 查看。Intelligence Index 的方法论详见 Intelligence Index 页面