MiniMax-M2.7vsMiniMax M2.5

Across 10 shared benchmarks, MiniMax-M2.7 leads overall: MiniMax-M2.7 wins 6, MiniMax M2.5 wins 4, with 0 ties and an average score difference of +2.01.

MiniMax-M2.7

MiniMaxAI · 2026-03-18 · Reasoning model

MiniMax M2.5

MiniMaxAI · 2026-02-12 · Reasoning model

MiniMax-M2.76 wins(60%)(40%)4 winsMiniMax M2.5

Benchmark scores

Grouped by capability, sorted by largest gap within each. 10 shared benchmarks.

General Knowledge

MiniMax-M2.7 3/3

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
HLE	2896 / 172Thinking (No Tools)	19.40121 / 172Thinking (No Tools)	+8.60
LiveBench	63.4956 / 115Deep Thinking (No Tools)	60.1468 / 115Deep Thinking (No Tools)	+3.35
GPQA Diamond	8742 / 187Thinking (No Tools)	85.2053 / 187Thinking (No Tools)	+1.80

Claw-style Agent Evaluation

MiniMax M2.5 2/2

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
Pinch Bench	87.109 / 37Thinking (With Tools)	87.806 / 37Thinking (With Tools)	-0.70
Claw Bench	91.705 / 29Thinking (With Tools)	92.104 / 29Thinking (With Tools)	-0.40

Agent Level Benchmark

MiniMax M2.5 1/1

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
τ²-Bench - Telecom	8524 / 35Thinking (With Tools)	97.8010 / 35	-12.80

Coding and Software Engineer

MiniMax-M2.7 1/1

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
SWE-Bench Pro - Public	56.2024 / 54Thinking (With Tools)	55.4026 / 54	+0.80

Instruction Following

MiniMax-M2.7 1/1

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
IF Bench	766 / 30Thinking (With Tools)	7013 / 30	+6

Long Context

MiniMax M2.5 1/1

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
AA-LCR	696 / 15Thinking (With Tools)	69.505 / 15Thinking (No Tools)	-0.50

Productivity Knowledge

MiniMax-M2.7 1/1

Benchmark	MiniMax-M2.7	MiniMax M2.5	Diff
GDPval-AA	5013 / 21Thinking (No Tools)	3617 / 21Thinking (No Tools)	+14

Specs

Field	MiniMax-M2.7	MiniMax M2.5
Publisher	MiniMaxAI	MiniMaxAI
Release date	2026-03-18	2026-02-12
Model type	Reasoning model	Reasoning model
Architecture	MoE	MoE
Parameters	229B	229B
Context length	200K	128K
Max output	200K	Not available

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

Item	MiniMax-M2.7	MiniMax M2.5
Text input	$0.3 / 1M tokens	$0.3 / 1M tokens
Text output	$1.2 / 1M tokens	$2.4 / 1M tokens
Cache read	$0.06 / 1M tokens	Not public
Cache write	$0.375 / 1M tokens	Not public

Summary

MiniMax-M2.7leads in:General Knowledge (3/3), Coding and Software Engineer (1/1), Instruction Following (1/1), Productivity Knowledge (1/1)
MiniMax M2.5leads in:Claw-style Agent Evaluation (2/2), Agent Level Benchmark (1/1), Long Context (1/1)

On average across the 10 shared benchmarks, MiniMax-M2.7 scores 2.01 higher.

Largest single-benchmark gap: GDPval-AA — MiniMax-M2.7 50 vs MiniMax M2.5 36 (+14).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.

MiniMax-M2.7 details MiniMax M2.5 details·Customize in compare tool