DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeAI ModelsKimi K2.5 vs Kimi K2

Kimi K2.5vsKimi K2

Across 9 shared benchmarks, Kimi K2.5 leads overall: Kimi K2.5 wins 8, Kimi K2 wins 1, with 0 ties and an average score difference of +25.61.

Moonshot AI
Kimi K2.5

Moonshot AI · 2026-01-27 · Multimodal model

Moonshot AI
Kimi K2

Moonshot AI · 2025-07-11 · AI model

Kimi K2.58 wins(89%)(11%)1 winKimi K2

Benchmark scores

Grouped by capability, sorted by largest gap within each. 9 shared benchmarks.

General Knowledge

Kimi K2.5 3/4
BenchmarkKimi K2.5Kimi K2Diff
ARC-AGI65.3031 / 65Thinking (No Tools)13.3057 / 65+52
HLE50.2017 / 149Thinking (With Tools)4.70146 / 149+45.50
GPQA Diamond87.6031 / 175Thinking (No Tools)75.1090 / 175+12.50
MMLU Pro78.5064 / 124Thinking (No Tools)81.1051 / 124-2.60

Math and Reasoning

Kimi K2.5 3/3
BenchmarkKimi K2.5Kimi K2Diff
AIME202596.1021 / 106Thinking (No Tools)5485 / 106+42.10
Simple Bench46.8013 / 27Thinking (No Tools)26.3024 / 27+20.50
FrontierMath - Tier 44.2040 / 80Normal (No Tools)0.0171 / 80+4.19

Coding and Software Engineer

Kimi K2.5 2/2
BenchmarkKimi K2.5Kimi K2Diff
LiveCodeBench8514 / 118Thinking (No Tools)53.7084 / 118+31.30
SWE-bench Verified76.8022 / 103Thinking (With Tools)51.8083 / 103+25

Specs

FieldKimi K2.5Kimi K2
PublisherMoonshot AIMoonshot AI
Release date2026-01-272025-07-11
Model typeMultimodal modelAI model
ArchitectureMoEMoE
Parameters10000.010000.0
Context length256K131K
Max output16384134144

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

ItemKimi K2.5Kimi K2
Text input0.6 美元/100 万tokens0.6 美元/100 万tokens
Text output3 美元/100 万tokens2.5 美元/100 万tokens
Cache read0.1 美元/100 万tokensNot public

Summary

  • Kimi K2.5leads in:General Knowledge (3/4), Math and Reasoning (3/3), Coding and Software Engineer (2/2)

On average across the 9 shared benchmarks, Kimi K2.5 scores 25.61 higher.

Largest single-benchmark gap: ARC-AGI — Kimi K2.5 65.30 vs Kimi K2 13.30 (+52).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.

Kimi K2.5 detailsKimi K2 details·Customize in compare tool