Qwen3.6-27BvsGemini 3.0 Flash
Across 5 shared benchmarks, Gemini 3.0 Flash leads overall: Qwen3.6-27B wins 2, Gemini 3.0 Flash wins 3, with 0 ties and an average score difference of -3.04.
Qwen3.6-27B
阿里巴巴 · 2026-04-22 · Reasoning model
Gemini 3.0 Flash
Google Deep Mind · 2025-12-17 · AI model
Qwen3.6-27B2 wins(40%)(60%)3 winsGemini 3.0 Flash
Benchmark scores
Grouped by capability, sorted by largest gap within each. 5 shared benchmarks.
General Knowledge
Gemini 3.0 Flash 2/2| Benchmark | Qwen3.6-27B | Gemini 3.0 Flash | Diff |
|---|---|---|---|
| HLE | 2484 / 149Thinking (No Tools) | 43.5033 / 149thinking + 使用工具 | -19.50 |
| GPQA Diamond | 87.8030 / 175Thinking (No Tools) | 90.4015 / 175thinking | -2.60 |
AI Agent - Tool Usage
Qwen3.6-27B 1/1| Benchmark |
|---|
Specs
| Field | Qwen3.6-27B | Gemini 3.0 Flash |
|---|---|---|
| Publisher | 阿里巴巴 | Google Deep Mind |
| Release date | 2026-04-22 | 2025-12-17 |
| Model type | Reasoning model | AI model |
| Architecture | Dense | Dense |
| Parameters | 270.0 | 0.0 |
| Context length | 128K | 2000K |
| Max output | 16384 | 65536 |
API pricing
Prices use DataLearner records when available; missing fields are not inferred.
| Item | Qwen3.6-27B | Gemini 3.0 Flash |
|---|---|---|
| Text input | Not public | 0.5 美元/100万 tokens |
| Text output | Not public | 3 美元/100万 tokens |
| Cache read | Not public | 0.05 美元/100万 tokens |
One or both models have incomplete public pricing.
Summary
- Qwen3.6-27Bleads in:AI Agent - Tool Usage (1/1), Coding and Software Engineer (1/1)
- Gemini 3.0 Flashleads in:General Knowledge (2/2), Claw-style Agent Evaluation (1/1)
On average across the 5 shared benchmarks, Gemini 3.0 Flash scores 3.04 higher.
Largest single-benchmark gap: HLE — Qwen3.6-27B 24 vs Gemini 3.0 Flash 43.50 (-19.50).
Page generated from structured model, pricing and benchmark records. No real-time LLM is used to write the prose.