Kimi K2.7 Code Benchmark Details

Kimi K2.7 Code currently shows benchmark results led by LiveBench (30 / 115, score 71.89), TerminalBench 2.1 (10 / 14, score 67.04), DeepSWE (7 / 9, score 31). This page also compares it with 3 competitor models and 3 predecessor or same-series models, including performance and pricing views when available.

Benchmark Results

Kimi K2.7 Code

Benchmark Results

Thinking
Tool usage

General Knowledge

1 evaluations
Benchmark / mode
Score
Rank/total
LiveBench
Standard Mode
71.89
30 / 115

AI Agent - Tool Usage

1 evaluations
Benchmark / mode
Score
Rank/total
TerminalBench 2.1
Thinking ModeTools
67.04
10 / 14

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
DeepSWE
Standard ModeTools
31
7 / 9

Competitor Comparison

Benchmark scores for Kimi K2.7 Code compared against top models in its class

Benchmark categories:
The chart shows each model’s highest score per benchmark within the current filter. Out-of-100 benchmarks use raw heights; out-of-range benchmarks are scaled within that benchmark while labels keep the original scores.

3 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.

BenchmarkKimi K2.7 CodeCurrentGLM-5.2MiniMax M3Qwen3.7 Max
LiveBench
综合评估
71.89Standard Mode
76.24Standard Mode
70.02Deep Thinking Mode
74.29Deep Thinking Mode
TerminalBench 2.1
AI Agent - 工具使用
67.04Thinking Enabled | Tools
81.00Thinking Level · High | Tools
--
--
DeepSWE
编程与软件工程
31.00Standard Mode | Tools
44.00Deep Thinking Mode | Tools
--
--

Standard API Pricing: Kimi K2.7 Code vs. Peer Models

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier.

These models use different currencies or billing units, so the page falls back to raw price values instead of a shared bar chart.

Kimi K2.7 Code
Supplier: Moonshot AI
Standard input: $0.95 / 1M tokens
Standard output: $4 / 1M tokens
GLM-5.2
Supplier: 智谱AI
Standard input: $1.4 / 1M tokens
Standard output: $4.4 / 1M tokens
MiniMax M3
Supplier: MiniMaxAI
Standard input: ¥2.1 / 1M tokens
Standard output: ¥8.4 / 1M tokens
ModelSupplierStandard inputStandard outputBase price applies to
Kimi K2.7 Code
Moonshot AI$0.95 / 1M tokens$4 / 1M tokens
GLM-5.2
智谱AI$1.4 / 1M tokens$4.4 / 1M tokens
MiniMax M3
MiniMaxAI¥2.1 / 1M tokens¥8.4 / 1M tokens

Version History

How each version of the Kimi K2.7 Code series stacks up on benchmark tests

Benchmark categories:
The chart shows each model’s highest score per benchmark within the current filter. Out-of-100 benchmarks use raw heights; out-of-range benchmarks are scaled within that benchmark while labels keep the original scores.

2 benchmarks with comparable scores. Each model shows its best score; mode label is displayed below.· Click a row to view its trend chart.

BenchmarkKimi K2.7 CodeCurrentKimi K2.6Kimi K2.5
LiveBench
综合评估
71.89Standard Mode
--
69.07Thinking Enabled
TerminalBench 2.1
AI Agent - 工具使用
67.04Thinking Enabled | Tools
53.56Thinking Enabled
--

Single-Benchmark Version Trend

Viewing: LiveBench · 综合评估

Benchmark
NormalNormal + ToolsThinkingThinking + ToolsDeepDeep + Tools

X-axis shows model and release date, Y-axis shows score; solid lines connect the same mode across versions, while dotted guides align modes within the same generation.

Standard API Pricing Across the Kimi K2.7 Code Series

Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.

Source: DataLearnerAI. Standard text prices shown here use the default supplier. · USD / 1M tokens

ModelSupplierStandard inputStandard outputBase price applies to
Kimi K2.7 Code
Moonshot AI$0.95 / 1M tokens$4 / 1M tokens
Kimi K2.6
Facebook AI研究实验室$0.95 / 1M tokens$4 / 1M tokens