DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
目录
Model catalogQwen3-4B-Thinking-2507
QW

Qwen3-4B-Thinking-2507

聊天大模型

Qwen3-4B-Thinking-2507

Release date: 2025-08-06更新于: 2025-08-07 10:45:36915
Live demoGitHubHugging FaceCompare
Parameters
40.0亿
Context length
256K
Chinese support
Supported
Reasoning ability

Qwen3-4B-Thinking-2507 is an AI model published by 阿里巴巴, released on 2025-08-06, for 聊天大模型, with 40.0B parameters, and 256K tokens context length, requiring about 8.05GB storage, under the Apache 2.0 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen3-4B-Thinking-2507

Model basics

Reasoning traces
Supported
Thinking modes
Thinking modes not supported
Context length
256K tokens
Max output length
16384 tokens
Model type
聊天大模型
Release date
2025-08-06
Model file size
8.05GB
MoE architecture
No
Total params / Active params
40.0B / N/A
Knowledge cutoff
No data
Qwen3-4B-Thinking-2507

Open source & experience

Code license
Apache 2.0
Weights license
Apache 2.0- 免费商用授权
GitHub repo
https://github.com/QwenLM/Qwen3
Hugging Face
https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507
Live demo
https://chat.qwen.ai/
Qwen3-4B-Thinking-2507

Official resources

Paper
Qwen3: Think Deeper, Act Faster
DataLearnerAI blog
No blog post yet
Qwen3-4B-Thinking-2507

API details

API speed
4/5
💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.
Standard pricingStandard
ModalityInputOutput
Text$0.11$1.26
Qwen3-4B-Thinking-2507

Benchmark Results

Qwen3-4B-Thinking-2507 currently shows benchmark results led by AIME2025 (55 / 106, score 81.30), LiveCodeBench (73 / 109, score 55.20), GPQA Diamond (116 / 166, score 65.80). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking
All modesThinking

综合评估

1 evaluations
Benchmark / mode
Score
Rank/total
GPQA Diamond
On
65.80
116 / 166

编程与软件工程

1 evaluations
Benchmark / mode
Score
Rank/total
LiveCodeBench
On
55.20
73 / 109

数学推理

1 evaluations
Benchmark / mode
Score
Rank/total
AIME2025
On
81.30
55 / 106
View benchmark analysisCompare with other models
Qwen3-4B-Thinking-2507

Publisher

阿里巴巴
阿里巴巴
View publisher details
Qwen3-4B-Thinking-2507

Model Overview

Qwen3-4B-2507是阿里发布的Qwen3-4B的更新版本,相比较4月28日,这个版本的模型拆分成thinking和非thinking不同的版本。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送

DataLearner 官方微信二维码