Qwen3.7-Max-Preview

Name: Qwen3.7-Max-Preview
Price: 2.5 USD
Availability: InStock
Author: 阿里巴巴

Reasoning modelQwen3.7

Qwen3.7-Max-Preview

Release date: 2026-05-20Updated: 2026-05-21 21:58:53.6063,810

Live demoGitHubHugging FaceCompare

Parameters

Context length

Chinese support

Supported

Reasoning ability

Qwen3.7-Max 是阿里云通义团队于2026年5月发布的闭源旗舰模型，定位为 Agent 工作流基座。模型在代码 Agent、通用 Agent 及长程自主执行方向进行了系统强化，在 GPQA Diamond（92.4）、HLE（41.4）、SWE-Pro（60.6）、MCP-Atlas（76.4）等主要基准上达到同批对比模型最高分，推理和 Agent 能力整体持平或小幅超越 Claude Opus 4.6 Max。官方实测显示模型可在未知硬件架构上持续自主运行 35 小时、执行逾千次工具调用，实现 10 倍算子加速。当前仅通过阿里云百炼平台 API 提供服务，兼容 OpenAI 与 Anthropic 两种调用协议。

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen3.7-Max-Preview

Model basics

Reasoning traces

Supported

Thinking modes

Thinking Mode (Default)Standard Mode

Context length

1M tokens

Max output length

64K tokens

Model type

Reasoning model

Modality (in / out)

No data

Release date

2026-05-20

Model file size

No data

MoE architecture

Yes

Total params / Active params

1T / N/A

Knowledge cutoff

No data

Qwen3.7-Max-Preview

Open source & experience

Code license

不开源

Weights license

不开源

GitHub repo

GitHub link unavailable

Hugging Face

Hugging Face link unavailable

Live demo

https://qwen.ai

Qwen3.7-Max-Preview

Official resources

Paper

Qwen3.7: The Agent Frontier

DataLearnerAI blog

No blog post yet

Qwen3.7-Max-Preview

API details

API speed

3/5

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Learn about pricing modes

Standard

Type	Condition	Input	Output
Text	-	$2.50/ 1M	$7.50/ 1M

Cache PricingPrompt Cache

Type	TTL	Write	Read
Text	5m	$3.13/ 1M	$0.250/ 1M

Qwen3.7-Max-Preview

Benchmark Results

Qwen3.7-Max-Preview currently shows benchmark results led by MMLU Pro (4 / 126, score 89.60), LiveCodeBench (4 / 120, score 91.60), GPQA Diamond (11 / 179, score 92.40). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.