加载中...
加载中...
See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 4 个模型的评测数据与核心参数。
Compare benchmark results across thinking modes and tool usage.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Performance benchmarks
Compare benchmark results across thinking modes and tool usage.
Best Overall
DeepSeek V3.2 · 57.58
Best Single
M2.1 · SWE-bench Verified 74.80
Thinking Mode (Default)
DeepSeek V3.2 · 2 All Modes
Higher is usually better; “—” means no score.
Complete scores for each model/mode across selected benchmarks.
Higher is usually better; “—” means no score.
| Benchmark | QW Qwen3-Coder-Next阿里巴巴 | DE DeepSeek V3.2DeepSeek-AI | GL GLM-4.7智谱AI | M2 M2.1MiniMaxAI | ||
|---|---|---|---|---|---|---|
| 编程与软件工程 | ||||||
SWE-Bench Pro - Public | 44.30 | 40.90 | — | 40.60 | — | 32.60 |
SWE-bench Verified | 70.60 | 70.20 | 73.10 | 73.80 | 74.80 | — |
| Agent能力评测 | ||||||
Aider-Polyglot | 66.20 | — | 69.90 | 52.10 | — | 61.00 |
| AI Agent - 工具使用 | ||||||
Terminal Bench 2.0 | 36.20 | — | 46.40 | 41.00 | — | 47.90 |
Feature compare
Licensing, MoE architecture, and multi-modality support.
| Features & specs | QW Qwen3-Coder-Next阿里巴巴 | DE DeepSeek V3.2DeepSeek-AI | GL GLM-4.7智谱AI | M2 M2.1MiniMaxAI |
|---|---|---|---|---|
Model snapshots | ||||
Organization | 阿里巴巴 | DeepSeek-AI | 智谱AI | MiniMaxAI |
模型全名 | Qwen3-Coder-Next | DeepSeek V3.2 (正式版) | GLM-4.7 | MiniMax M2.1 Preview |
模型简介 | Not provided | Not provided | Not provided | Not provided |
模型类型 | 编程大模型 | 推理大模型 | 聊天大模型 | 聊天大模型 |
模型代号 | qwen3-coder-next | deepseek-v3-2 | glm-4-7 | minimax-m2-1-preview |
Release | 2026-02-03 | 2025-12-01 | 2025-12-22 | 2025-12-23 |
MoE | Yes | Yes | Yes | Yes |
规格与性能 | ||||
Context length | 256K | 128K | 200K | 200K |
Parameters | 80 | 6710 | 3580 | 2300 |
激活参数量 | 3 | 370 | 320 | 100 |
模型规模 | 7b | 100b | 100b | 100b |
模型大小 | 48GB | 1.34TB | Not provided | Not provided |
推理速度 | ||||
推理等级 | ||||
最大输出 | 65536 | 8192 | 132072 | 131072 |
Supported modes | 常规模式(Non-Thinking Mode) | 常规模式(Non-Thinking Mode)思考模式(Thinking Mode) | 常规模式(Non-Thinking Mode)思考模式(Thinking Mode) | 常规模式(Non-Thinking Mode)思考模式(Thinking Mode) |
开源与许可 | ||||
Code Open Source | Not provided | Not provided | Closed Source | Closed Source |
Weights Open Source | Not provided | Not provided | Closed Source | Not provided |
Commercial use | 免费商用授权 | 免费商用授权 | 免费商用授权 | 免费商用授权 |
Modality support | ||||
Text Input/Output | / | / | / | / |
Image Input/Output | / | / | / | / |
Audio Input/Output | / | / | / | / |
Video Input/Output | / | / | / | / |
Embedding Input/Output | / | / | / | / |
API 接口详情 | ||||
Text 价格 | Not provided | Input: 0.28 美元/100万 tokensOutput: 0.42 美元/100万 tokensCache: 0.028 美元/100万 tokens | Input: 0.6 美元/100万 tokensOutput: 2.2 美元/100万 tokensCache: 0.11 美元/100万 tokens | Input: 0.3 美元/100 万tokensOutput: 1.2 美元/100 万tokensCache: 0.03 美元/100 万tokens |
Image API pricing | Not provided | Not provided | Not provided | Not provided |
Audio API pricing | Not provided | Not provided | Not provided | Not provided |
Video API pricing | Not provided | Not provided | Not provided | Not provided |
Embedding API pricing | Not provided | Not provided | Not provided | Not provided |
Resources | ||||
GitHub | Repo | Repo | Repo | Repo |
Hugging Face | Model Page | Model Page | Model Page | Model Page |
Official Page | Not provided | Not provided | Not provided | Not provided |
Guides | Not provided | Not provided | Not provided | Not provided |
Papers | Qwen3-Coder-Next: Pushing Small Hybrid Models on Agentic Coding | DeepSeek-V3.2 正式版发布与说明 | GLM-4.7: Advancing the Coding Capability | MiniMax M2.1: Significantly Enhanced Multi-Language Programming, Built for Real-World Complex Tasks |
DataLearnerAI | Not provided | 复杂问题推理能力大幅提升,DeepSeekAI发布DeepSeek V3.2正式版本以及一个评测结果可以媲美Gemini 3.0 Pro的将开源模型推到极限性能的DeepSeek-V3.2-Speciale模型 | Not provided | Not provided |
API pricing
Side-by-side input/output token pricing