Qwen 3.6 Plus PreviewvsQwen3.5-397B-A17B

在 14 个共同 benchmark 中,Qwen 3.6 Plus Preview 整体领先:Qwen 3.6 Plus Preview 领先 12 项,Qwen3.5-397B-A17B 领先 2 项,持平 0 项,平均分差 +2.59。

阿里巴巴
Qwen 3.6 Plus Preview

阿里巴巴 · 2026-03-31 · 聊天大模型

阿里巴巴
Qwen3.5-397B-A17B

阿里巴巴 · 2026-02-16 · 多模态大模型

Qwen 3.6 Plus Preview12 (86%)(14%)2 Qwen3.5-397B-A17B

评测分数

按能力类目分组,每组内按分差大小排列;共 14 项。

Coding and Software Engineer

Qwen 3.6 Plus Preview 领先 4/4
评测项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B分差
SWE-Bench Pro - Public56.6013 / 43Thinking (With Tools)50.9029 / 43Thinking (No Tools)+5.70
SWE-bench Multilingual73.807 / 20Thinking (No Tools)69.3017 / 20Thinking (No Tools)+4.50
LiveCodeBench87.1010 / 120Thinking (No Tools)83.6020 / 120Thinking (No Tools)+3.50
SWE-bench Verified78.8020 / 108Thinking (With Tools)76.4029 / 108Thinking (With Tools)+2.40

General Knowledge

Qwen 3.6 Plus Preview 领先 4/4
评测项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B分差
HLE50.6017 / 157Thinking (With Tools)48.3028 / 157Thinking (With Tools + Internet)+2.30
GPQA Diamond90.4017 / 178Thinking (No Tools)88.4026 / 178Thinking (No Tools)+2
MMLU Pro88.505 / 126Thinking (No Tools)87.8010 / 126Thinking (No Tools)+0.70
C-Eval93.302 / 9Thinking (No Tools)933 / 9Thinking (No Tools)+0.30

AI Agent - Tool Usage

Qwen 3.6 Plus Preview 领先 2/2
评测项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B分差
Terminal Bench 2.061.6016 / 46Thinking (With Tools)52.5029 / 46Thinking (With Tools)+9.10
Tool Decathlon39.804 / 7Thinking (With Tools)38.305 / 7Thinking (With Tools)+1.50

Math and Reasoning

Qwen 3.6 Plus Preview 领先 2/2
评测项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B分差
AIME 202695.302 / 14Thinking (No Tools)91.3011 / 14Thinking (No Tools)+4
IMO-AnswerBench83.8010 / 19Thinking (No Tools)80.9015 / 19Thinking (No Tools)+2.90

Instruction Following

Qwen3.5-397B-A17B 领先 1/1
评测项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B分差
IF Bench74.206 / 29Thinking (No Tools)76.503 / 29Thinking (No Tools)-2.30

Long Context

Qwen3.5-397B-A17B 领先 1/1
评测项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B分差
AA-LCR68.306 / 13Thinking (No Tools)68.705 / 13Thinking (No Tools)-0.40

规格对比

字段Qwen 3.6 Plus PreviewQwen3.5-397B-A17B
发布机构阿里巴巴阿里巴巴
发布时间2026-03-312026-02-16
模型类型聊天大模型多模态大模型
架构稠密模型MoE 架构
参数规模暂无数据397亿
上下文长度1M256K
最大输出64K暂无数据

API 调用价格

价格优先使用 DataLearner 配置的 API 记录;缺失项不做推测。

价格项Qwen 3.6 Plus PreviewQwen3.5-397B-A17B
文本输入$0.5 / 1M tokens$0.5 / 1M tokens
文本输出$3 / 1M tokens$3 / 1M tokens
缓存读取$0.05 / 1M tokens$0.05 / 1M tokens
缓存写入$0.625 / 1M tokens$0.625 / 1M tokens

小结

  • Qwen 3.6 Plus Preview在以下类目领先:Coding and Software Engineer (4/4)、General Knowledge (4/4)、AI Agent - Tool Usage (2/2)、Math and Reasoning (2/2)
  • Qwen3.5-397B-A17B在以下类目领先:Instruction Following (1/1)、Long Context (1/1)

14 个共同 benchmark 上,Qwen 3.6 Plus Preview 平均高出 2.59 分。

单项差距最大的 benchmark:Terminal Bench 2.0 — Qwen 3.6 Plus Preview 61.60,Qwen3.5-397B-A17B 52.50(分差 +9.10)。

本页正文由结构化模型、价格与 benchmark 数据生成,不使用实时 LLM 撰写。