Kimi K2 Thinking

Name: Kimi K2 Thinking
Availability: InStock
Author: Moonshot AI

推理大模型

Kimi K2 Thinking

Release date: 2025-11-06更新于: 2025-11-07 09:40:591,746

Live demo GitHub Hugging Face Compare

Parameters

10400.0亿

Context length

256K

Chinese support

Supported

Reasoning ability

Kimi K2 Thinking is an AI model published by Moonshot AI, released on 2025-11-06, for 推理大模型, with 10400.0B parameters, and 256K tokens context length, requiring about 1.09 TB storage, under the Modified MIT License license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Kimi K2 Thinking

Model basics

Reasoning traces

Supported

Thinking modes

Thinking modes not supported

Context length

256K tokens

Max output length

No data

Model type

Kimi K2 Thinking

Open source & experience

Code license

Modified MIT License

Weights license

Modified MIT License- 免费商用授权

GitHub repo

https://github.com/MoonshotAI/Kimi-K2

Hugging Face

https://huggingface.co/moonshotai/Kimi-K2-Thinking

Kimi K2 Thinking

Official resources

Paper

Introducing Kimi K2 Thinking

DataLearnerAI blog

Moonshot AI 发布 Kimi K2 Thinking：连续执行200-300次顺序工具调用，人类最后难题评测得分超过所有模型，全球第一！依然免费开源商用！

Kimi K2 Thinking

API details

API speed

3/5

Kimi K2 Thinking

Benchmark Results

Kimi K2 Thinking currently shows benchmark results led by AIME2025 (1 / 107, score 100), HLE (9 / 128, score 51), Terminal-Bench (4 / 35, score 47.10). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

综合评估

4 evaluations

Benchmark / mode

Score

Rank/total

MMLU Pro

Medium

84.60

24 / 116

GPQA Diamond

Medium

84.50

41 / 166

LiveBench

Medium

67.93

26 / 51

HLE

Medium

23.90

67 / 128

编程与软件工程

1 evaluations

Benchmark / mode

Score

Rank/total

LiveCodeBench

Medium

83.10

14 / 109

数学推理

1 evaluations

Benchmark / mode

Score

Rank/total

AIME2025

Medium

94.50

27 / 107

指令跟随

1 evaluations

Benchmark / mode

Score

Rank/total

IF Bench

Medium

15 / 27

View benchmark analysis Compare with other models

Kimi K2 Thinking

Publisher

Moonshot AI

View publisher details

Kimi K2 Thinking

Model Overview

Moonshot AI 于 2025 年 11 月 6 日发布了 Kimi K2 Thinking 模型。这是 Kimi K2 系列的第一个推理变体。该公司位于北京，由阿里巴巴支持。Kimi K2 系列此前在 2025 年 7 月和 9 月发布了非推理版本的 Kimi K2 Instruct 模型。

模型规格

Kimi K2 Thinking 采用混合专家（MoE）架构，总参数量为 1 万亿，活跃参数为 320 亿。它使用 INT4 精度，这比先前版本的 FP8 精度更节省空间，模型文件大小约为 594 GB。上下文窗口支持 256K 令牌。输入和输出仅限于文本模态。

Moonshot AI 在后训练阶段使用了量化感知训练，以实现 INT4 精度。这使得模型在较旧的 NVIDIA GPU 上运行时更高效，因为这些 GPU 不支持 FP4。

功能和能力

该模型设计为思考代理，支持多步推理和工具使用。它可以执行 200 到 300 个连续工具调用，而无需人类干预。主要应用包括推理、代理搜索和编码任务。在测试时，它通过扩展思考令牌和工具调用轮次来处理复杂任务。

在代理任务基准中，Kimi K2 Thinking 在 Tau2 Bench Telecom 上得分 93%，模拟客户服务代理场景。在 HLE 上得分 44.9%，在 BrowseComp 上得分 60.2%。这些分数使其在开源模型中位居前列。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送