Qwen3-235B-A22B-Thinking-2507

Name: Qwen3-235B-A22B-Thinking-2507
Availability: InStock
Author: 阿里巴巴

推理大模型

Release date: 2025-07-25更新于: 2025-07-27 23:27:051,281

Live demo GitHub Hugging Face Compare

Parameters

2350.0亿

Context length

256K

Chinese support

Supported

Reasoning ability

Qwen3-235B-A22B-Thinking-2507 is an AI model published by 阿里巴巴, released on 2025-07-25, for 推理大模型, with 2350.0B parameters, and 256K tokens context length, requiring about 470.77 GB storage, under the Apache 2.0 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen3-235B-A22B-Thinking-2507

Model basics

Reasoning traces

Supported

Thinking modes

Thinking modes not supported

Context length

256K tokens

Max output length

32768 tokens

Model type

Qwen3-235B-A22B-Thinking-2507

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/QwenLM/Qwen3

Hugging Face

https://huggingface.co/Qwen/Qwen3-235B-A22B-Thinking-2507

Qwen3-235B-A22B-Thinking-2507

Official resources

Paper

Qwen3-235B-A22B-Instruct-2507

DataLearnerAI blog

阿里发布Qwen3小幅更新版本，放弃混合思考模式，发布全新的2个版本Qwen3-235B-A22B-2507模型，1/5的参数，性能直逼Kimi K2，推理模式版本评测结果接近o3

Qwen3-235B-A22B-Thinking-2507

API details

API speed

3/5

Qwen3-235B-A22B-Thinking-2507

Benchmark Results

Qwen3-235B-A22B-Thinking-2507 currently shows benchmark results led by MMLU Pro (26 / 116, score 84.40), Creative Writing (5 / 22, score 86.10), LiveCodeBench (32 / 109, score 74.10). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

综合评估

4 evaluations

Benchmark / mode

Score

Rank/total

MMLU Pro

Medium

84.40

26 / 116

GPQA Diamond

Medium

81.10

56 / 166

LiveBench

Medium

69.11

22 / 51

HLE

Medium

18.20

82 / 128

编程与软件工程

1 evaluations

Benchmark / mode

Score

Rank/total

LiveCodeBench

Medium

74.10

32 / 109

数学推理

1 evaluations

Benchmark / mode

Score

Rank/total

AIME2025

Medium

92.30

33 / 107

写作和创作

1 evaluations

Benchmark / mode

Score

Rank/total

Creative Writing

Medium

86.10

5 / 22

View benchmark analysis Compare with other models

Qwen3-235B-A22B-Thinking-2507

Model variants & downloads

Variant name	Version type	Quantization	Model size	HuggingFace link
Qwen3-235B-A22B-Thinking-2507-FP8ℹ️	Instruct	FP8	236.45 GB	Download link

Qwen3-235B-A22B-Thinking-2507

Publisher

阿里巴巴

View publisher details

Qwen3-235B-A22B-Thinking-2507

Model Overview

阿里巴巴开源的Qwen3-235B-A22B模型的升级版本，最早的Qwen3-235B-A22B模型是在2025年4月28日随着Qwen3系列一起发布，当时是推理和非推理模式混合的架构模型，后来阿里发现这个模式不好，因此在2025年7月份发布了更新版的模型，即不支持推理模式的Qwen3-235B-A22B-2507和支持推理模式的Qwen3-235B-A22B-Thinking-2507。

Qwen3-235B-A22B-Thinking-2507最多可以支持80K的推理过程长度，最高支持32K的答案输出，是当前推理过程最长的模型之一！

DataLearner 官方微信

欢迎关注 DataLearner 官方微信，获得最新 AI 技术推送