Qwen3.7-Max-Preview
Qwen3.7-Max 是阿里云通义团队于2026年5月发布的闭源旗舰模型,定位为 Agent 工作流基座。模型在代码 Agent、通用 Agent 及长程自主执行方向进行了系统强化,在 GPQA Diamond(92.4)、HLE(41.4)、SWE-Pro(60.6)、MCP-Atlas(76.4)等主要基准上达到同批对比模型最高分,推理和 Agent 能力整体持平或小幅超越 Claude Opus 4.6 Max。官方实测显示模型可在未知硬件架构上持续自主运行 35 小时、执行逾千次工具调用,实现 10 倍算子加速。当前仅通过阿里云百炼平台 API 提供服务,兼容 OpenAI 与 Anthropic 两种调用协议。
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Model basics
Open source & experience
Official resources
API details
| Type | Condition | Input | Output |
|---|---|---|---|
| Text | - | $2.50/ 1M | $7.50/ 1M |
| Type | TTL | Write | Read |
|---|---|---|---|
| Text | 5m | $3.13/ 1M | $0.250/ 1M |
Benchmark Results
Qwen3.7-Max-Preview currently shows benchmark results led by MMLU Pro (4 / 126, score 89.60), LiveCodeBench (4 / 120, score 91.60), GPQA Diamond (11 / 179, score 92.40). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.
General Knowledge
4 evaluationsCoding and Software Engineer
4 evaluationsCompare with other models
- Earlier versionQwen3.7-Max-Preview vs Qwen3.6-Max-Preview10 benchmarks
- Peer modelQwen3.7-Max-Preview vs DeepSeek-V4-Pro9 benchmarks
- Peer modelQwen3.7-Max-Preview vs Kimi K2.68 benchmarks
- Earlier versionQwen3.7-Max-Preview vs Qwen3-Max-Thinking7 benchmarks
- Peer modelQwen3.7-Max-Preview vs GLM 5.15 benchmarks
Want a custom combination? Open the compare tool
Publisher
Model Overview
Qwen3.7-Max 是阿里云通义团队于2026年5月发布的闭源旗舰模型,定位为 Agent 工作流基座。模型在代码 Agent、通用 Agent 及长程自主执行方向进行了系统强化,在 GPQA Diamond(92.4)、HLE(41.4)、SWE-Pro(60.6)、MCP-Atlas(76.4)等主要基准上达到同批对比模型最高分,推理和 Agent 能力整体持平或小幅超越 Claude Opus 4.6 Max。官方实测显示模型可在未知硬件架构上持续自主运行 35 小时、执行逾千次工具调用,实现 10 倍算子加速。当前仅通过阿里云百炼平台 API 提供服务,兼容 OpenAI 与 Anthropic 两种调用协议。
DataLearner on WeChat
Follow DataLearner on WeChat for AI model updates and research notes.
