Qwen3.5-27B Benchmark Details
Qwen3.5-27B currently shows benchmark results led by Pinch Bench (2 / 37, score 90), IF Bench (2 / 27, score 76.50), MMLU Pro (11 / 116, score 86.10). This page also compares it with 1 competitor models and 2 predecessor or same-series models, including performance and pricing views when available. 1 source link is attached for reference.
Benchmark Results
Benchmark Results
综合评估
5 evaluations编程与软件工程
3 evaluationsAI Agent - 工具使用
2 evaluationsOpenClaw智能体能力综合测评
2 evaluationsCompetitor Comparison
Benchmark scores for Qwen3.5-27B compared against top models in its class
Benchmark Score Comparison
5 benchmarks with comparable scores
| Benchmark | Qwen3.5-27B(This model) | Gemma 4 31B |
|---|---|---|
GPQA Diamond 综合评估 | 85.50 思考模式(无工具) | 84.30 思考模式(无工具) |
HLE 综合评估 | 48.50 思考模式(工具) | 26.50 思考模式(工具+联网) |
MMLU Pro 综合评估 | 86.10 思考模式(无工具) | 85.20 思考模式(无工具) |
LiveCodeBench 编程与软件工程 | 80.70 思考模式(工具) | 80.00 思考模式(无工具) |
τ²-Bench Agent能力评测 | 79.00 思考模式(工具) | 76.90 思考模式(工具) |
Standard API Pricing: Qwen3.5-27B vs. Peer Models
Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.
Source: DataLearnerAI. Standard text prices shown here use the default supplier.
Version History
How each version of the Qwen3.5-27B series stacks up on benchmark tests
Benchmark Score Comparison
3 benchmarks with comparable scores
| Benchmark | Qwen3.5-27B(This model) | Qwen3-32B | Qwen2.5-32B |
|---|---|---|---|
GPQA Diamond 综合评估 | 85.50 思考模式(无工具) | 68.40 thinking | -- |
MMLU Pro 综合评估 | 86.10 思考模式(无工具) | -- | 69.23 normal |
LiveCodeBench 编程与软件工程 | 80.70 思考模式(工具) | 65.70 normal | 51.20 normal |
Standard API Pricing Across the Qwen3.5-27B Series
Shows standard text input and output pricing side by side for each model. If extended-context pricing exists, the chart keeps the base rate and explains the threshold below.
Source: DataLearnerAI. Standard text prices shown here use the default supplier.
These models use different currencies or billing units, so the page falls back to raw price values instead of a shared bar chart.
| Model | Supplier | Standard input | Standard output | Base price applies to |
|---|---|---|---|---|
Qwen3-32B | — | 0.7 美元/100 万tokens | 2.8 美元/100 万tokens | — |
Series Overview
See how each version of the Qwen3.5-27B series performs across major benchmarks. Click any row to break down scores by reasoning mode.
Tip: click any score cell to switch the chart below.
| Benchmark | Qwen2.5-32B9/18/2024 | Qwen3-32B4/28/2025 | Qwen3.5-27B2/25/2026 |
|---|---|---|---|
Single-Benchmark Mode Relation
Viewing: GPQA Diamond · 综合评估