DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
Page navigation
Model catalogDeepSeek-R1-Distill-Qwen-7B
DE

DeepSeek-R1-Distill-Qwen-7B

Reasoning modelDeepSeek-R1

DeepSeek-R1-Distill-Qwen-7B

Release date: 2025-01-20Updated: 2025-02-27 22:11:471,261
Live demoGitHubHugging FaceCompare
Parameters
7B
Context length
128K (131072)
Chinese support
Supported
Reasoning ability

DeepSeek-R1-Distill-Qwen-7B is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 70.0B parameters, and 128K tokens context length, requiring about 14GB storage, under the MIT License license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeek-R1-Distill-Qwen-7B

Model basics

Reasoning traces
Supported
Thinking modes
Thinking modes not supported
Context length
128K (131072) tokens
Max output length
No data
Model type
Reasoning model
Release date
2025-01-20
Model file size
14GB
MoE architecture
No
Total params / Active params
7B / N/A
Knowledge cutoff
No data
DeepSeek-R1-Distill-Qwen-7B

Open source & experience

Code license
MIT License
Weights license
MIT License- 免费商用授权
GitHub repo
GitHub link unavailable
Hugging Face
https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Live demo
No live demo
DeepSeek-R1-Distill-Qwen-7B

Official resources

Paper
DeepSeek-R1-Distill-Qwen-7B
DataLearnerAI blog
No blog post yet
DeepSeek-R1-Distill-Qwen-7B

API details

API speed
No data
No public API pricing yet.
DeepSeek-R1-Distill-Qwen-7B

Benchmark Results

DeepSeek-R1-Distill-Qwen-7B currently shows benchmark results led by AIME 2024 (45 / 62, score 53.30), MATH-500 (32 / 44, score 91.40), GPQA Diamond (153 / 177, score 49.50). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking

General Knowledge

1 evaluations
Benchmark / mode
Score
Rank/total
GPQA Diamond
Standard Mode
49.50
153 / 177

Math and Reasoning

2 evaluations
Benchmark / mode
Score
Rank/total
MATH-500
Standard Mode
91.40
32 / 44
AIME 2024
Standard Mode
53.30
45 / 62
View benchmark analysisCompare with other models
DeepSeek-R1-Distill-Qwen-7B

Publisher

DeepSeek-AI
DeepSeek-AI
View publisher details
DeepSeek-R1-Distill-Qwen-7B

Model Overview

DeepSeek-R1-Distill-Qwen-7B is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 70.0B parameters, and 128K tokens context length, requiring about 14GB storage, under the MIT License license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool