Qwen3.6-27BvsQwen3-32B

Across 3 shared benchmarks, Qwen3.6-27B leads overall: Qwen3.6-27B wins 3, Qwen3-32B wins 0, with 0 ties and an average score difference of +31.30.

阿里巴巴 · 2026-04-22 · Reasoning model

阿里巴巴 · 2025-04-28 · Reasoning model

Qwen3.6-27B3 wins(100%)(0%)0 winsQwen3-32B

Benchmark scores

Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.

Qwen3.6-27B 2/2

Benchmark	Qwen3.6-27B	Qwen3-32B	Diff
GPQA Diamond	87.8033 / 178Thinking (No Tools)	54.60148 / 178Normal (No Tools)	+33.20
C-Eval	91.405 / 9Thinking (No Tools)	83.309 / 9Normal (No Tools)	+8.10

Qwen3.6-27B 1/1

Benchmark	Qwen3.6-27B	Qwen3-32B	Diff
LiveCodeBench	83.9019 / 120Thinking (No Tools)	31.30114 / 120Normal (No Tools)	+52.60

On average across the 3 shared benchmarks, Qwen3.6-27B scores 31.30 higher.

Largest single-benchmark gap: LiveCodeBench — Qwen3.6-27B 83.90 vs Qwen3-32B 31.30 (+52.60).

Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.