QW

Qwen2.5-3B

Foundation modelQwen2.5

Qwen2.5-3B

Release date: 2024-09-18Updated: 2024-09-21 11:23:261,218
Parameters
3B
Context length
32K
Chinese support
Supported
Reasoning ability

Qwen2.5-3B is an AI model published by 阿里巴巴, released on 2024-09-18, for Foundation model, with 3B parameters, and 32K context length, requiring about 6GB storage, under the Tongyi Qianwen RESEARCH LICENSE AGREEMENT license, with a 79.10 score on GSM8K.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen2.5-3B

Model basics

Reasoning traces
Not supported
Thinking modes
Thinking modes not supported
Context length
32K tokens
Max output length
No data
Model type
Foundation model
Modality (in / out)
No data
Release date
2024-09-18
Model file size
6GB
MoE architecture
No
Total params / Active params
3B / N/A
Knowledge cutoff
No data
Qwen2.5-3B

Open source & experience

Code license
Weights license
Live demo
No live demo
Qwen2.5-3B

Official resources

DataLearnerAI blog
No blog post yet
Qwen2.5-3B

API details

API speed
No data
No public API pricing yet.
Qwen2.5-3B

Benchmark Results

Qwen2.5-3B currently shows benchmark results led by GSM8K (17 / 26, score 79.10), BBH (16 / 20, score 56.30), MBPP (24 / 28, score 57.10). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking

General Knowledge

4 evaluations
Benchmark / mode
Score
Rank/total
65.60
63 / 65
56.30
16 / 20
34.60
123 / 126
24.30
176 / 179

Math and Reasoning

2 evaluations
Benchmark / mode
Score
Rank/total
79.10
17 / 26
42.60
37 / 42

Coding and Software Engineer

2 evaluations
Benchmark / mode
Score
Rank/total
57.10
24 / 28
42.10
34 / 39

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

Qwen2.5-3B

Publisher

Qwen2.5-3B

Model Overview

Qwen2.5-3B is an AI model published by 阿里巴巴, released on 2024-09-18, for Foundation model, with 3B parameters, and 32K context length, requiring about 6GB storage, under the Tongyi Qianwen RESEARCH LICENSE AGREEMENT license, with a 79.10 score on GSM8K.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code