Qwen3.6-27BvsQwen3-32B
Across 3 shared benchmarks, Qwen3.6-27B leads overall: Qwen3.6-27B wins 3, Qwen3-32B wins 0, with 0 ties and an average score difference of +31.30.
Qwen3.6-27B
阿里巴巴 · 2026-04-22 · Reasoning model
Qwen3-32B
阿里巴巴 · 2025-04-28 · Reasoning model
Qwen3.6-27B3 wins(100%)(0%)0 winsQwen3-32B
Benchmark scores
Grouped by capability, sorted by largest gap within each. 3 shared benchmarks.
General Knowledge
Qwen3.6-27B 2/2| Benchmark | Qwen3.6-27B | Qwen3-32B | Diff |
|---|---|---|---|
| GPQA Diamond | 87.8033 / 178Thinking (No Tools) | 54.60148 / 178Normal (No Tools) | +33.20 |
| C-Eval | 91.405 / 9Thinking (No Tools) | 83.309 / 9Normal (No Tools) | +8.10 |
Coding and Software Engineer
Qwen3.6-27B 1/1| Benchmark | Qwen3.6-27B | Qwen3-32B | Diff |
|---|---|---|---|
| LiveCodeBench | 83.9019 / 120Thinking (No Tools) | 31.30114 / 120Normal (No Tools) | +52.60 |
Specs
| Field | Qwen3.6-27B | Qwen3-32B |
|---|---|---|
| Publisher | 阿里巴巴 | 阿里巴巴 |
| Release date | 2026-04-22 | 2025-04-28 |
| Model type | Reasoning model | Reasoning model |
| Architecture | Dense | Dense |
| Parameters | 27B | 32B |
| Context length | 128K | 128K |
| Max output | 16K | 16K |
Summary
- Qwen3.6-27Bleads in:General Knowledge (2/2), Coding and Software Engineer (1/1)
On average across the 3 shared benchmarks, Qwen3.6-27B scores 31.30 higher.
Largest single-benchmark gap: LiveCodeBench — Qwen3.6-27B 83.90 vs Qwen3-32B 31.30 (+52.60).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.