DE

DeepSeek-R1-0528-Qwen3-8B

Reasoning modelDeepSeek R1 Distill

DeepSeek-R1-0528-Qwen3-8B

Release date: 2025-05-30Updated: 2025-05-30 11:10:231,370
Live demoGitHubHugging FaceCompare
Parameters
8B
Context length
64K
Chinese support
Supported
Reasoning ability

DeepSeek-R1-0528-Qwen3-8B is an AI model published by DeepSeek-AI, released on 2025-05-30, for Reasoning model, with 8B parameters, and 64K context length, requiring about 16GB storage, under the MIT License license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeek-R1-0528-Qwen3-8B

Model basics

Reasoning traces
Supported
Thinking modes
Thinking modes not supported
Context length
64K tokens
Max output length
40K tokens
Model type
Reasoning model
Modality (in / out)
Text → Text
Release date
2025-05-30
Model file size
16GB
MoE architecture
No
Total params / Active params
8B / N/A
Knowledge cutoff
No data
DeepSeek-R1-0528-Qwen3-8B

Open source & experience

Code license
Weights license
MIT License- 免费商用授权
GitHub repo
GitHub link unavailable
Live demo
No live demo
DeepSeek-R1-0528-Qwen3-8B

Official resources

Paper
No paper available
DataLearnerAI blog
No blog post yet
DeepSeek-R1-0528-Qwen3-8B

API details

API speed
4/5
No public API pricing yet.
DeepSeek-R1-0528-Qwen3-8B

Benchmark Results

No benchmark data to show.

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

DeepSeek-R1-0528-Qwen3-8B

Publisher

DeepSeek-R1-0528-Qwen3-8B

Model Overview

DeepSeek-R1-0528-Qwen3-8B is an AI model published by DeepSeek-AI, released on 2025-05-30, for Reasoning model, with 8B parameters, and 64K context length, requiring about 16GB storage, under the MIT License license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code