MiniMax M2.5vsKimi K2.5

在 13 个共同 benchmark 中,MiniMax M2.5 整体领先:MiniMax M2.5 领先 7 项,Kimi K2.5 领先 6 项,持平 0 项,平均分差 -0.99。

MiniMaxAI
MiniMax M2.5

MiniMaxAI · 2026-02-12 · 推理大模型

Moonshot AI
Kimi K2.5

Moonshot AI · 2026-01-27 · 多模态大模型

MiniMax M2.57 (54%)(46%)6 Kimi K2.5

评测分数

按能力类目分组,每组内按分差大小排列;共 13 项。

General Knowledge

Kimi K2.5 领先 4/4
评测项MiniMax M2.5Kimi K2.5分差
HLE19.40106 / 157Thinking (No Tools)50.2020 / 157Thinking (With Tools)-30.80
ARC-AGI-24.9044 / 59Thinking (No Tools)11.8036 / 59Thinking (No Tools)-6.90
GPQA Diamond85.2048 / 178Thinking (No Tools)87.6034 / 178Thinking (No Tools)-2.40
ARC-AGI63.7032 / 65Thinking (No Tools)65.3031 / 65Thinking (No Tools)-1.60

Claw-style Agent Evaluation

MiniMax M2.5 领先 2/2
评测项MiniMax M2.5Kimi K2.5分差
Claw Bench92.104 / 29Thinking (With Tools)81.7018 / 29Thinking (With Tools)+10.40
Pinch Bench87.806 / 37Thinking (With Tools)84.8017 / 37Thinking (With Tools)+3

Coding and Software Engineer

MiniMax M2.5 领先 2/2
评测项MiniMax M2.5Kimi K2.5分差
SWE-Bench Pro - Public55.4018 / 4350.7031 / 43Thinking (With Tools)+4.70
SWE-bench Verified80.2013 / 10876.8027 / 108Thinking (With Tools)+3.40

AI Agent - Information Search

MiniMax M2.5 领先 1/1
评测项MiniMax M2.5Kimi K2.5分差
BrowseComp76.3018 / 4560.6029 / 45Thinking (With Tools + Internet)+15.70

AI Agent - Tool Usage

MiniMax M2.5 领先 1/1
评测项MiniMax M2.5Kimi K2.5分差
Terminal Bench 2.051.7030 / 4650.8033 / 46Thinking (With Tools)+0.90

Long Context

MiniMax M2.5 领先 1/1
评测项MiniMax M2.5Kimi K2.5分差
AA-LCR69.503 / 13Thinking (No Tools)6510 / 13Thinking (No Tools)+4.50

Math and Reasoning

Kimi K2.5 领先 1/1
评测项MiniMax M2.5Kimi K2.5分差
AIME202586.3048 / 106Thinking (No Tools)96.1021 / 106Thinking (No Tools)-9.80

Productivity Knowledge

Kimi K2.5 领先 1/1
评测项MiniMax M2.5Kimi K2.5分差
GDPval-AA3617 / 21Thinking (No Tools)4015 / 21Thinking (No Tools)-4

规格对比

字段MiniMax M2.5Kimi K2.5
发布机构MiniMaxAIMoonshot AI
发布时间2026-02-122026-01-27
模型类型推理大模型多模态大模型
架构MoE 架构MoE 架构
参数规模2290亿1万亿
上下文长度128K256K
最大输出暂无数据16K

API 调用价格

价格优先使用 DataLearner 配置的 API 记录;缺失项不做推测。

价格项MiniMax M2.5Kimi K2.5
文本输入$0.3 / 1M tokens暂无公开价格
文本输出$2.4 / 1M tokens暂无公开价格

部分模型公开价格不完整,缺失字段按"暂无公开价格"展示。

小结

  • MiniMax M2.5在以下类目领先:Claw-style Agent Evaluation (2/2)、Coding and Software Engineer (2/2)、AI Agent - Information Search (1/1)、AI Agent - Tool Usage (1/1)、Long Context (1/1)
  • Kimi K2.5在以下类目领先:General Knowledge (4/4)、Math and Reasoning (1/1)、Productivity Knowledge (1/1)

13 个共同 benchmark 上,Kimi K2.5 平均高出 0.99 分。

单项差距最大的 benchmark:HLE — MiniMax M2.5 19.40,Kimi K2.5 50.20(分差 -30.80)。

本页正文由结构化模型、价格与 benchmark 数据生成,不使用实时 LLM 撰写。