Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index aggregates multiple rigorous benchmarks to compare AI model intelligence across coding, reasoning, science, tool use, and agentic tasks.

Top Model

Claude Fable 5

Top Score

60

Model Count

216

Data version

2026年06月28日

Data source: Artificial Analysis

Origin:AllChina
Leaderboard snapshot month:

Ranking Table

RankModelIntelligence IndexOrganization
AnthropicClaude Fable 5Anthropic60Anthropic
AnthropicClaude Opus 4.8 (max)Anthropic56Anthropic
OpenAIGPT-5.5 (xhigh)OpenAI55OpenAI
4AnthropicOpus 4.7 (max)Anthropic54Anthropic
5OpenAIGPT-5.5 (high)OpenAI53OpenAI
6GLM-5.2 (max)智谱AI51智谱AI
7OpenAIGPT-5.5 (medium)OpenAI50OpenAI
8Google Deep MindGemini 3.5 FlashGoogle Deep Mind50Google Deep Mind
9AnthropicClaude Sonnet 4.6 (max)Anthropic47Anthropic
10Google Deep MindGemini 3.1 Pro PreviewGoogle Deep Mind46Google Deep Mind
11Qwen3.7 Max阿里巴巴46阿里巴巴
12GoogleGemini 3.5 Flash (medium)Google45Google
13MiniMaxMiniMax-M3MiniMax44MiniMax
14DeepSeek-AIDeepSeek-V4-Pro (max)DeepSeek-AI44DeepSeek-AI
15OpenAIGPT-5.3 Codex (xhigh)OpenAI44OpenAI
16OpenAIGPT-5.5 (low)OpenAI43OpenAI
17Muse SparkFacebook AI研究实验室43Facebook AI研究实验室
18Moonshot AIKimi K2.6Moonshot AI43Moonshot AI
19AnthropicOpus 4.7 (high)Anthropic43Anthropic
20MiMo-V2.5-ProXiaomi42Xiaomi
21KimiKimi K2.7 CodeKimi42Kimi
22Nex-N2-ProNex AGI41Nex AGI
23DeepSeek-AIDeepSeek-V4-Pro (high)DeepSeek-AI41DeepSeek-AI
24DeepSeek-AIDeepSeek-V4-Flash (max)DeepSeek-AI40DeepSeek-AI
25GLM 5.1智谱AI40智谱AI
26MiMo-V2.5Xiaomi40Xiaomi
27OpenAIGPT-5.4 mini (xhigh)OpenAI40OpenAI
28xAIGrok Build 0.1 0616xAI40xAI
29Qwen 3.6 Plus Preview阿里巴巴40阿里巴巴
30AlibabaQwen3.7 PlusAlibaba39Alibaba
31OpenAIGPT-5.4 nano (xhigh)OpenAI38OpenAI
32MiniMaxAIMiniMax-M2.7MiniMaxAI38MiniMaxAI
33GLM-5-Turbo智谱AI38智谱AI
34NVIDIANemotron 3 UltraNVIDIA38NVIDIA
35xAIGrok 4.3 Beta (high)xAI38xAI
36DeepSeek-AIDeepSeek-V4-Flash (high)DeepSeek-AI37DeepSeek-AI
37Qwen3.6-27B阿里巴巴37阿里巴巴
38Nova 2 Omni(Preview)亚马逊36亚马逊
39xAIGrok 4.3 Beta (medium)xAI36xAI
40AnthropicClaude Sonnet 4.6 (non-reasoning)Anthropic36Anthropic
41xAIGrok 4.3 Beta (low)xAI35xAI
42OpenAIGPT-5.5 (non-reasoning)OpenAI35OpenAI
43GLM 5.1智谱AI35智谱AI
44MiMo-V2-OmniXiaomi35Xiaomi
45Google Deep MindGemini 3.5 Flash (minimal)Google Deep Mind35Google Deep Mind
46Moonshot AIKimi K2.6Moonshot AI35Moonshot AI
47GLM-5V-Turbo智谱AI34智谱AI
48AnthropicClaude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic34Anthropic
49Qwen3.5-397B-A17B阿里巴巴34阿里巴巴
50Hy3 Pre腾讯AI实验室34腾讯AI实验室
51OpenAIGPT-5.5 Instant (May 2026)OpenAI34OpenAI
52DeepMindGemini 2.0 Flash ExperimentalDeepMind33DeepMind
53Qwen3.5-122B-A10B阿里巴巴32阿里巴巴
54Qwen3.5-397B-A17B阿里巴巴32阿里巴巴
55Qwen3.6-35B-A3B阿里巴巴32阿里巴巴
56DeepSeek-AIDeepSeek-V4-ProDeepSeek-AI31DeepSeek-AI
57Qwen3.5-Omni-Plus阿里巴巴31阿里巴巴
58Ring-2.6-1TInclusionAI31InclusionAI
59OpenAIOpenAI o3OpenAI30OpenAI
60OpenAIGPT-5.4 nanoOpenAI30OpenAI
61MistralAIMistral Medium 3.5MistralAI30MistralAI
62OpenAIGPT-5.4 mini (medium)OpenAI30OpenAI
63StepFunStep 3.7 FlashStepFun30StepFun
64AnthropicHaiku 4.5Anthropic30Anthropic
65DeepMindGemma 4 31BDeepMind29DeepMind
66CohereAIC4AI Command A (202503)CohereAI29CohereAI
67Qwen3.6-27B阿里巴巴29阿里巴巴
68DeepSeek-AIDeepSeek-V4-FlashDeepSeek-AI29DeepSeek-AI
69JT-35B-FlashChina Mobile28China Mobile
70Qwen3.5-122B-A10B阿里巴巴28阿里巴巴
71MiMo-V2.5-ProXiaomi28Xiaomi
72Hy3 Pre腾讯AI实验室26腾讯AI实验室
73Ling-2.6-1TInclusionAI26InclusionAI
74StepFunAIStep 3.5 FlashStepFunAI26StepFunAI
75ByteDance SeedDoubao Seed CodeByteDance Seed26ByteDance Seed
76Google Deep MindGemini 2.5-ProGoogle Deep Mind26Google Deep Mind
77DeepMindGemma 4 26B A4BDeepMind26DeepMind
78NVIDIANVIDIA Nemotron 3 SuperNVIDIA25NVIDIA
79Mercury 2Inception25Inception
80GoogleGemini 3.1 Flash-LiteGoogle25Google
81Qwen3.5-9B-Instruct阿里巴巴25阿里巴巴
82DeepMindGemma 4 31BDeepMind25DeepMind
83xAIGrok 4.3 (Non-reasoning)xAI25xAI
84K-EXAONELG AI Research25LG AI Research
85MiMo-V2-FlashXiaomi25Xiaomi
86Trinity Large ThinkingArcee AI24Arcee AI
87Qwen3.6-35B-A3B阿里巴巴24阿里巴巴
88OpenAIGPT OSS 120B (high)OpenAI24OpenAI
89AnthropicHaiku 4.5Anthropic24Anthropic
90Qwen3.5-35B-A3B阿里巴巴23阿里巴巴
91EXAONE 4.5 33BLG AI Research23LG AI Research
92HyperNova 60B 2605Multiverse Computing22Multiverse Computing
93GoogleGemma 4 12BGoogle22Google
94ERNIE 5.0百度22百度
95Nova 2 Pro(Preview) (medium)亚马逊22亚马逊
96NVIDIANemotron Cascade 2 30B A3BNVIDIA21NVIDIA
97Qwen3-Coder-Next阿里巴巴21阿里巴巴
98Nova 2 Omni(Preview) (medium)亚马逊21亚马逊
99MistralMistral Small 4Mistral21Mistral
100CohereNorth Mini CodeCohere21Cohere
101Qwen3.5-9B-Instruct阿里巴巴20阿里巴巴
102DeepMindGemma 4 26B A4BDeepMind20DeepMind
103AlibabaQwen3.5 4BAlibaba20Alibaba
104Qwen3-Next阿里巴巴20阿里巴巴
105Nova 2 Pro(Preview) (low)亚马逊20亚马逊
106Ling 2.6 FlashInclusionAI19InclusionAI
107MistralDevstral 2Mistral19Mistral
108Nova 2 Lite (medium)亚马逊19亚马逊
109Qwen3.5-Omni-Flash阿里巴巴19阿里巴巴
110JT-MINIChina Mobile19China Mobile
111Nova 2 Lite (high)亚马逊18亚马逊
112MistralMagistral Medium 1.2Mistral18Mistral
113Nova 2 Lite (low)亚马逊18亚马逊
114OpenAIGPT OSS 120B (low)OpenAI18OpenAI
115OpenAIGPT-5.4 nanoOpenAI18OpenAI
116LongCat Flash LiteLongCat17LongCat
117K-EXAONELG AI Research17LG AI Research
118OpenAIGPT-5.4 miniOpenAI17OpenAI
119Nova 2 Omni(Preview) (low)亚马逊17亚马逊
120Mi:dm K 2.5 ProKorea Telecom16Korea Telecom
121AlibabaQwen3.5 4BAlibaba16Alibaba
122MistralAIMistral Large 3MistralAI16MistralAI
123INTELLECT-3Prime Intellect16Prime Intellect
124Solar Open 100BUpstage15Upstage
125Qwen3-Omni-30B-A3B (reasoning)阿里巴巴15阿里巴巴
126OpenAIGPT OSS 20B (high)OpenAI15OpenAI
127Nova 2 Pro(Preview)亚马逊14亚马逊
128OpenAIGPT OSS 20B (low)OpenAI14OpenAI
129Llama 4 MaverickFacebook AI研究实验室14Facebook AI研究实验室
130NVIDIANVIDIA Nemotron 3 NanoNVIDIA14NVIDIA
131Solar Pro 3Upstage14Upstage
132Qwen3-Next阿里巴巴14阿里巴巴
133GoogleGemma 4 12B (Non-reasoning)Google13Google
134MistralDevstral Small 2Mistral13Mistral
135Motif-2-12.7BMotif Technologies13Motif Technologies
136AmazonNova PremierAmazon13Amazon
137DeepMindGemma 4 E4BDeepMind12DeepMind
138MetaLlama Nemotron Super 49B v1.5Meta12Meta
139MistralMistral Small 4Mistral12Mistral
140MiniCPM5-1BOpenBMB12OpenBMB
141MistralMagistral Small 1.2Mistral12Mistral
142Sarvam 105B (high)Sarvam12Sarvam
143Nova 2 Lite亚马逊12亚马逊
144MiniCPM5-1BOpenBMB12OpenBMB
145MistralAIMinistral 3 14BMistralAI11MistralAI
146EXAONE 4.0 32BLG AI Research11LG AI Research
147Nova 2 Omni(Preview)亚马逊11亚马逊
148AlibabaQwen3.5 2BAlibaba10Alibaba
149Nanbeige4.1-3BNanbeige10Nanbeige
150Llama 4 ScoutFacebook AI研究实验室10Facebook AI研究实验室
151Falcon-H1R-7BTII UAE10TII UAE
152Qwen3-Omni-30B-A3B阿里巴巴10阿里巴巴
153StepFunStep3 VL 10BStepFun9StepFun
154DeepMindGemma 4 E2BDeepMind9DeepMind
155NVIDIALlama Nemotron UltraNVIDIA9NVIDIA
156ERNIE-4.5-300B-A47B百度9百度
157Solar Pro 2Upstage9Upstage
158NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA9NVIDIA
159MistralAIMinistral 3 8BMistralAI9MistralAI
160DeepMindGemma 4 E4BDeepMind9DeepMind
161Granite 4.1 30BIBM9IBM
162NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA9NVIDIA
163NVIDIANVIDIA Nemotron 3 Nano 4BNVIDIA9NVIDIA
164AlibabaQwen3.5 2BAlibaba9Alibaba
165MetaLlama Nemotron Super 49B v1.5Meta9Meta
166Llama3.3-70B-InstructFacebook AI研究实验室9Facebook AI研究实验室
167KimiKimi Linear 48B A3B InstructKimi9Kimi
168Llama3.1-405BFacebook AI研究实验室9Facebook AI研究实验室
169LFM2.5-8B-A1BLiquid AI8Liquid AI
170Ring-flash-2.0InclusionAI8InclusionAI
171Solar Pro 2Upstage8Upstage
172CohereAIC4AI Command A (202503)CohereAI8CohereAI
173NVIDIALlama 3.1 Nemotron 70BNVIDIA8NVIDIA
174NVIDIANVIDIA Nemotron 3 NanoNVIDIA7NVIDIA
175NVIDIANVIDIA Nemotron Nano 9B V2NVIDIA7NVIDIA
176MistralMinistral 3 3BMistral7Mistral
177Granite 4.1 8BIBM7IBM
178Sarvam 30B (high)Sarvam7Sarvam
179DeepMindGemma 4 E2BDeepMind6DeepMind
180PerplexityR1 1776Perplexity6Perplexity
181Llama 3.2-Vision-90BFacebook AI研究实验室6Facebook AI研究实验室
182EXAONE 4.0 32BLG AI Research6LG AI Research
183Jamba 1.7 LargeAI21 Labs5AI21 Labs
184Granite 4.0 H SmallIBM5IBM
185Qwen3-Omni-30B-A3B阿里巴巴5阿里巴巴
186AlibabaQwen3.5 0.8BAlibaba5Alibaba
187LFM2 24B A2BLiquid AI5Liquid AI
188Microsoft AzurePhi 4 - 14BMicrosoft Azure5Microsoft Azure
189Amazon Nova Micro亚马逊5亚马逊
190NVIDIANVIDIA Nemotron Nano 12B v2 VLNVIDIA5NVIDIA
191Microsoft AzurePhi-4-multimodal-instruct Microsoft Azure5Microsoft Azure
192AlibabaQwen3.5 0.8BAlibaba4Alibaba
193MiniCPM-V 4.6 1.3BOpenBMB4OpenBMB
194Jamba Reasoning 3BAI21 Labs4AI21 Labs
195Google Deep MindGemini 3.0 FlashGoogle Deep Mind4Google Deep Mind
196Ling-mini-2.0InclusionAI4InclusionAI
197Llama 3.2-Vision-11BFacebook AI研究实验室3Facebook AI研究实验室
198Granite 4.1 3BIBM3IBM
199Microsoft AzurePhi-4-mini-instruct (3.8B)Microsoft Azure3Microsoft Azure
200Exaone 4.0 1.2BLG AI Research3LG AI Research
201Exaone 4.0 1.2BLG AI Research3LG AI Research
202LFM2.5-1.2B-ThinkingLiquid AI3Liquid AI
203Jamba 1.7 MiniAI21 Labs3AI21 Labs
204LFM2 2.6BLiquid AI3Liquid AI
205LFM2.5-1.2B-InstructLiquid AI3Liquid AI
206Granite 4.0 H 1BIBM3IBM
207Google Deep MindGemma 3-270MGoogle Deep Mind2Google Deep Mind
208Apertus 70B InstructSwiss AI2Swiss AI
209Granite 4.0 MicroIBM2IBM
210Granite 4.0 1BIBM2IBM
211LFM2 8B A1BLiquid AI2Liquid AI
212LFM2.5-VL-1.6BLiquid AI1Liquid AI
213Granite 4.0 350MIBM1IBM
214CohereTiny Aya GlobalCohere1Cohere
215Apertus 8B InstructSwiss AI1Swiss AI
216Granite 4.0 H 350MIBM1IBM

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

Benchmark Components (Intelligence Index v4.0)

The Intelligence Index aggregates 10 rigorous benchmarks to provide a holistic measure of AI capabilities, preventing narrow specialization.

GDPval-AA
Agentic real-world tasks
τ²-Bench
Agentic tool use
Terminal-Bench
Agentic coding
SciCode
Coding proficiency
AA-LCR
Long context reasoning
AA-Omniscience
Knowledge & hallucination
IFBench
Instruction following
Humanity's Last Exam
Reasoning & knowledge
GPQA Diamond
Scientific reasoning
CritPt
Physics reasoning

FAQ

What is the Artificial Analysis Intelligence Index?
The Artificial Analysis Intelligence Index v4.0 is a composite benchmark that aggregates performance across 10 challenging evaluations — spanning mathematics, science, coding, agentic tasks, and reasoning — to measure AI capabilities holistically. It is designed to prevent narrow specialization and provide a single score for tracking progress.
How is the Intelligence Index calculated?
The index aggregates scores from 10 benchmarks: GDPval-AA (agentic real-world tasks), τ²-Bench (tool use), Terminal-Bench Hard (agentic coding), SciCode (coding), AA-LCR (long context reasoning), AA-Omniscience (knowledge & hallucination), IFBench (instruction following), Humanity's Last Exam (reasoning), GPQA Diamond (scientific reasoning), and CritPt (physics). All tests are independently run by Artificial Analysis on standardized hardware.
How does this differ from LMArena?
LMArena rankings are based on crowdsourced user votes (Elo ratings from blind A/B tests), reflecting subjective human preferences. The Artificial Analysis Intelligence Index uses standardized automated benchmarks with objective scoring, measuring technical capabilities across specific domains. Both perspectives are valuable — LMArena captures real-world user experience, while AA Intelligence Index provides reproducible technical measurements.
Where can I find the original data?
The original leaderboard and detailed methodology are available at artificialanalysis.ai. The Intelligence Index methodology is documented at Intelligence Index page.