Qwen2.5-3B

Name: Qwen2.5-3B
Author: 阿里巴巴

Foundation modelQwen2.5

Qwen2.5-3B

Release date: 2024-09-18Updated: 2024-09-21 11:23:261,218

Live demoGitHub Hugging Face Compare

Parameters

Context length

32K

Chinese support

Supported

Reasoning ability

Qwen2.5-3B is an AI model published by 阿里巴巴, released on 2024-09-18, for Foundation model, with 3B parameters, and 32K context length, requiring about 6GB storage, under the Tongyi Qianwen RESEARCH LICENSE AGREEMENT license, with a 79.10 score on GSM8K.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen2.5-3B

Model basics

Reasoning traces

Not supported

Thinking modes

Thinking modes not supported

Context length

32K tokens

Max output length

No data

Model type

Foundation model

Modality (in / out)

No data

Release date

2024-09-18

Model file size

6GB

MoE architecture

Total params / Active params

3B / N/A

Knowledge cutoff

No data

Qwen2.5-3B

Open source & experience

Code license

Apache 2.0

Weights license

Tongyi Qianwen RESEARCH LICENSE AGREEMENT- 免费商用授权

GitHub repo

https://github.com/QwenLM/Qwen2.5

Hugging Face

https://huggingface.co/Qwen/Qwen2.5-3B

Live demo

No live demo

Qwen2.5-3B

Official resources

Paper

Qwen2.5-LLM: Extending the boundary of LLMs

DataLearnerAI blog

No blog post yet

Qwen2.5-3B

API details

API speed

No data

No public API pricing yet.

Qwen2.5-3B

Benchmark Results

Qwen2.5-3B currently shows benchmark results led by GSM8K (17 / 26, score 79.10), BBH (16 / 20, score 56.30), MBPP (24 / 28, score 57.10). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.