- MiniMax M2.5leads in:Coding and Software Engineer (2/2), Agent Level Benchmark (1/1), AI Agent - Information Search (1/1), AI Agent - Tool Usage (1/1), Claw-style Agent Evaluation (1/1), Math and Reasoning (1/1)
- Tied in:General Knowledge, Instruction Following
On average across the 10 shared benchmarks, MiniMax M2.5 scores 8.21 higher.
Largest single-benchmark gap: BrowseComp — MiniMax M2.5 76.30 vs M2.1 47.40 (+28.90).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.