MiniMax M2.5vsMiniMax M2

Across 7 shared benchmarks, MiniMax M2.5 leads overall: MiniMax M2.5 wins 6, MiniMax M2 wins 1, with 0 ties and an average score difference of +10.57.

MiniMax M2.5

MiniMaxAI · 2026-02-12 · Reasoning model

MiniMax M2

MiniMaxAI · 2025-10-27 · Chat model

MiniMax M2.56 wins(86%)(14%)1 winMiniMax M2

Benchmark scores

Grouped by capability, sorted by largest gap within each. 7 shared benchmarks.

General Knowledge

MiniMax M2.5 2/2

Benchmark	MiniMax M2.5	MiniMax M2	Diff
GPQA Diamond	85.2053 / 187Thinking (No Tools)	7889 / 187	+7.20
HLE	19.40121 / 172Thinking (No Tools)	12.50140 / 172	+6.90

Agent Level Benchmark

MiniMax M2.5 1/1

Benchmark	MiniMax M2.5	MiniMax M2	Diff
τ²-Bench - Telecom	97.8010 / 35	8722 / 35	+10.80

AI Agent - Information Search

MiniMax M2.5 1/1

Benchmark	MiniMax M2.5	MiniMax M2	Diff
BrowseComp	76.3023 / 53	4446 / 53	+32.30

Coding and Software Engineer

MiniMax M2.5 1/1

Benchmark	MiniMax M2.5	MiniMax M2	Diff
SWE-bench Verified	80.2014 / 112	69.4062 / 112	+10.80

Instruction Following

MiniMax M2 1/1

Benchmark	MiniMax M2.5	MiniMax M2	Diff
IF Bench	7013 / 30	72.3010 / 30	-2.30

Math and Reasoning

MiniMax M2.5 1/1

Benchmark	MiniMax M2.5	MiniMax M2	Diff
AIME2025	86.3049 / 107Thinking (No Tools)	7861 / 107	+8.30

Specs

Field	MiniMax M2.5	MiniMax M2
Publisher	MiniMaxAI	MiniMaxAI
Release date	2026-02-12	2025-10-27
Model type	Reasoning model	Chat model
Architecture	MoE	MoE
Parameters	229B	230B
Context length	128K	205K
Max output	Not available	Not available

API pricing

Prices use DataLearner records when available; missing fields are not inferred.

Item	MiniMax M2.5	MiniMax M2
Text input	$0.3 / 1M tokens	¥2.1 / 1M tokens
Text output	$2.4 / 1M tokens	¥8.4 / 1M tokens
Cache read	Not public	¥0.21 / 1M tokens
Cache write	Not public	¥2.625 / 1M tokens

Summary

MiniMax M2.5leads in:General Knowledge (2/2), Agent Level Benchmark (1/1), AI Agent - Information Search (1/1), Coding and Software Engineer (1/1), Math and Reasoning (1/1)
MiniMax M2leads in:Instruction Following (1/1)

On average across the 7 shared benchmarks, MiniMax M2.5 scores 10.57 higher.

Largest single-benchmark gap: BrowseComp — MiniMax M2.5 76.30 vs MiniMax M2 44 (+32.30).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.

MiniMax M2.5 details MiniMax M2 details·Customize in compare tool