Kimi K2.7 Code Benchmark Details

Kimi K2.7 Code currently shows benchmark results led by LiveBench (30 / 115, score 71.89), TerminalBench 2.1 (10 / 14, score 67.04), DeepSWE (7 / 9, score 31). This page also compares it with 3 competitor models and 3 predecessor or same-series models, including performance and pricing views when available.

Benchmark Results

Kimi K2.7 Code

Benchmark Results

General Knowledge

1 evaluations

Benchmark / mode

Score

Rank/total

LiveBench

Standard Mode

71.89

30 / 115

AI Agent - Tool Usage

1 evaluations

Benchmark / mode

Score

Rank/total

TerminalBench 2.1

Thinking ModeTools

67.04

10 / 14

Coding and Software Engineer

1 evaluations

Benchmark / mode

Score

Rank/total

DeepSWE

Standard ModeTools

7 / 9

Compare with other models

Competitor Comparison

Benchmark scores for Kimi K2.7 Code compared against top models in its class

Kimi K2.7 CodeGLM-5.2 MiniMax M3 Qwen3.7 Max

Benchmark categories:

The chart shows each model’s highest score per benchmark within the current filter. Out-of-100 benchmarks use raw heights; out-of-range benchmarks are scaled within that benchmark while labels keep the original scores.

3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

Benchmark	Kimi K2.7 CodeCurrent	GLM-5.2	MiniMax M3	Qwen3.7 Max
LiveBench 综合评估	71.89Standard Mode	76.24Standard Mode	70.02Deep Thinking Mode	74.29Deep Thinking Mode
TerminalBench 2.1 AI Agent - 工具使用	67.04Thinking Enabled ｜ Tools	81.00Thinking Level · High ｜ Tools	--	--
DeepSWE 编程与软件工程	31.00Standard Mode ｜ Tools	44.00Deep Thinking Mode ｜ Tools	--	--

Standard API Pricing: Kimi K2.7 Code vs. Peer Models

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier.

These models use different currencies or billing units, so the page falls back to raw price values instead of a shared bar chart.

Kimi K2.7 Code

Supplier: Moonshot AI

Standard input: $0.95 / 1M tokens

Standard output: $4 / 1M tokens

GLM-5.2

Supplier: 智谱AI

Standard input: $1.4 / 1M tokens

Standard output: $4.4 / 1M tokens

MiniMax M3

Supplier: MiniMaxAI

Standard input: ¥2.1 / 1M tokens

Standard output: ¥8.4 / 1M tokens

Model	Supplier	Standard input	Standard output	Base price applies to
Kimi K2.7 Code	Moonshot AI	$0.95 / 1M tokens	$4 / 1M tokens	—
GLM-5.2	智谱AI	$1.4 / 1M tokens	$4.4 / 1M tokens	—
MiniMax M3	MiniMaxAI	¥2.1 / 1M tokens	¥8.4 / 1M tokens	—

Version History

How each version of the Kimi K2.7 Code series stacks up on benchmark tests

Kimi K2.7 CodeKimi K2.6 Kimi K2.5 Kimi K2 Thinking

Benchmark categories:

2 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.· Click a row to view its trend chart.

Benchmark	Kimi K2.7 CodeCurrent	Kimi K2.6	Kimi K2.5
LiveBench 综合评估	71.89Standard Mode	--	69.07Thinking Enabled
TerminalBench 2.1 AI Agent - 工具使用	67.04Thinking Enabled ｜ Tools	53.56Thinking Enabled	--

Single-Benchmark Version Trend

Viewing: LiveBench · 综合评估

Benchmark

NormalNormal + ToolsThinkingThinking + ToolsDeepDeep + Tools

X-axis shows model and release date, Y-axis shows score; solid lines connect the same mode across versions, while dotted guides align modes within the same generation.

Standard API Pricing Across the Kimi K2.7 Code Series

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens

Model	Supplier	Standard input	Standard output	Base price applies to
Kimi K2.7 Code	Moonshot AI	$0.95 / 1M tokens	$4 / 1M tokens	—
Kimi K2.6	Facebook AI研究实验室	$0.95 / 1M tokens	$4 / 1M tokens	—