Qwen3.6-27BvsQwen3-32B

Across 3 shared benchmarks, Qwen3.6-27B leads overall: Qwen3.6-27B wins 3, Qwen3-32B wins 0, with 0 ties and an average score difference of +31.30.

阿里巴巴
Qwen3.6-27B

阿里巴巴 · 2026-04-22 · Reasoning model

阿里巴巴
Qwen3-32B

阿里巴巴 · 2025-04-28 · Reasoning model

Qwen3.6-27B3 wins(100%)(0%)0 winsQwen3-32B

Benchmark scores

Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.

General Knowledge

Qwen3.6-27B 2/2
BenchmarkQwen3.6-27BQwen3-32BDiff
GPQA Diamond87.8033 / 178Thinking (No Tools)54.60148 / 178Normal (No Tools)+33.20
C-Eval91.405 / 9Thinking (No Tools)83.309 / 9Normal (No Tools)+8.10

Coding and Software Engineer

Qwen3.6-27B 1/1
BenchmarkQwen3.6-27BQwen3-32BDiff
LiveCodeBench83.9019 / 120Thinking (No Tools)31.30114 / 120Normal (No Tools)+52.60

Specs

FieldQwen3.6-27BQwen3-32B
Publisher阿里巴巴阿里巴巴
Release date2026-04-222025-04-28
Model typeReasoning modelReasoning model
ArchitectureDenseDense
Parameters27B32B
Context length128K128K
Max output16K16K

Summary

  • Qwen3.6-27Bleads in:General Knowledge (2/2), Coding and Software Engineer (1/1)

On average across the 3 shared benchmarks, Qwen3.6-27B scores 31.30 higher.

Largest single-benchmark gap: LiveCodeBench — Qwen3.6-27B 83.90 vs Qwen3-32B 31.30 (+52.60).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.