DataLearner logoDataLearnerAI
AI Tech Blogs
Leaderboards
Benchmarks
Models
Resources
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
Loading comparison...
Table of Contents
目录
  1. Home
  2. Model Compare
  3. Results

大模型评测对比结果

See key specs and per-benchmark scores for each model/mode. Scroll horizontally for all columns. 当前对比 4 个模型的评测数据与核心参数。

Qwen3-Coder-NextDeepSeek V3.2GLM-4.7M2.1
规格对比
阿里巴巴

Qwen3-Coder-Next

QW

Qwen3-Coder-Next

Release2026-02-03
Context length256K
Parameters80
常规模式(Non-Thinking Mode)
Model profile
DeepSeek-AI

DeepSeek V3.2

DE

DeepSeek V3.2 (正式版)

Release2025-12-01
Context length128K
Parameters6710
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)
Model profilePlayground
智谱AI

GLM-4.7

GL

GLM-4.7

Release2025-12-22
Context length200K
Parameters3580
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)
Model profilePlayground
MiniMaxAI

M2.1

M2

MiniMax M2.1 Preview

Release2025-12-23
Context length200K
Parameters2300
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)
Model profilePlayground

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Performance benchmarks

Compare benchmark results across thinking modes and tool usage.

All Modes
Shortcuts
Thinking Mode (Default)
Thinking Mode (Default) - Help
  • Default: Thinking Mode (Default) (Standard/Medium)
  • All: Thinking Mode (All)
All Tools & Parallel

Best Overall

DeepSeek V3.2 · 57.58

Best Single

M2.1 · SWE-bench Verified 74.80

Thinking Mode (Default)

DeepSeek V3.2 · 2 All Modes

Benchmark scores

Higher is usually better; “—” means no score.

Filter: All Modes6 All Modes · 4 Benchmark
图表加载中...

Benchmark score table

Complete scores for each model/mode across selected benchmarks.

Benchmark scores

Higher is usually better; “—” means no score.

4 Benchmark6 All Modes
Supported modes:NormalThinkDeepToolParallel
Benchmark
QW
Qwen3-Coder-Next阿里巴巴
DE
DeepSeek V3.2DeepSeek-AI
GL
GLM-4.7智谱AI
M2
M2.1MiniMaxAI
编程与软件工程
SWE-Bench Pro - Public
44.3040.90—40.60—32.60
SWE-bench Verified
70.6070.2073.1073.8074.80—
Agent能力评测
Aider-Polyglot
66.20—69.9052.10—61.00
AI Agent - 工具使用
Terminal Bench 2.0
36.20—46.4041.00—47.90

Feature compare

Detailed feature breakdown

Licensing, MoE architecture, and multi-modality support.

Features & specs
QW
Qwen3-Coder-Next阿里巴巴
DE
DeepSeek V3.2DeepSeek-AI
GL
GLM-4.7智谱AI
M2
M2.1MiniMaxAI

Model snapshots

Organization
阿里巴巴DeepSeek-AI智谱AIMiniMaxAI
模型全名
Qwen3-Coder-NextDeepSeek V3.2 (正式版)GLM-4.7MiniMax M2.1 Preview
模型简介
Not providedNot providedNot providedNot provided
模型类型
编程大模型推理大模型聊天大模型聊天大模型
模型代号
qwen3-coder-nextdeepseek-v3-2glm-4-7minimax-m2-1-preview
Release
2026-02-032025-12-012025-12-222025-12-23
MoE
YesYesYesYes

规格与性能

Context length
256K128K200K200K
Parameters
80671035802300
激活参数量
3370320100
模型规模
7b100b100b100b
模型大小
48GB1.34TBNot providedNot provided
推理速度
推理等级
最大输出
655368192132072131072
Supported modes
常规模式(Non-Thinking Mode)
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)
常规模式(Non-Thinking Mode)思考模式(Thinking Mode)

开源与许可

Code Open Source
Not providedNot providedClosed SourceClosed Source
Weights Open Source
Not providedNot providedClosed SourceNot provided
Commercial use
免费商用授权免费商用授权免费商用授权免费商用授权

Modality support

Text Input/Output
/
/
/
/
Image Input/Output
/
/
/
/
Audio Input/Output
/
/
/
/
Video Input/Output
/
/
/
/
Embedding Input/Output
/
/
/
/

API 接口详情

Text 价格
Not provided
Input: 0.28 美元/100万 tokensOutput: 0.42 美元/100万 tokensCache: 0.028 美元/100万 tokens
Input: 0.6 美元/100万 tokensOutput: 2.2 美元/100万 tokensCache: 0.11 美元/100万 tokens
Input: 0.3 美元/100 万tokensOutput: 1.2 美元/100 万tokensCache: 0.03 美元/100 万tokens
Image API pricing
Not providedNot providedNot providedNot provided
Audio API pricing
Not providedNot providedNot providedNot provided
Video API pricing
Not providedNot providedNot providedNot provided
Embedding API pricing
Not providedNot providedNot providedNot provided

Resources

GitHub
RepoRepoRepoRepo
Hugging Face
Model PageModel PageModel PageModel Page
Official Page
Not providedNot providedNot providedNot provided
Guides
Not providedNot providedNot providedNot provided
Papers
Qwen3-Coder-Next: Pushing Small Hybrid Models on Agentic CodingDeepSeek-V3.2 正式版发布与说明GLM-4.7: Advancing the Coding CapabilityMiniMax M2.1: Significantly Enhanced Multi-Language Programming, Built for Real-World Complex Tasks
DataLearnerAI
Not provided复杂问题推理能力大幅提升,DeepSeekAI发布DeepSeek V3.2正式版本以及一个评测结果可以媲美Gemini 3.0 Pro的将开源模型推到极限性能的DeepSeek-V3.2-Speciale模型Not providedNot provided

API pricing

API price comparison

Side-by-side input/output token pricing

Higher is usually better; “—” means no score.