DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
Page navigation
目录
Model catalogDeepSeek-R1-Distill-Qwen-7B
DE

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Qwen-7B

Release date: 2025-01-20更新于: 2025-02-27 22:11:471,078
Live demoGitHubHugging FaceCompare
Parameters
70.0亿
Context length
128K
Chinese support
Supported
Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeek-R1-Distill-Qwen-7B

Model basics

Reasoning traces
Supported
Context length
128K tokens
Max output length
No data
Model type
推理大模型
Release date
2025-01-20
Model file size
14GB
MoE architecture
No
Total params / Active params
70.0B / N/A
Knowledge cutoff
No data
Inference modes
No mode data
DeepSeek-R1-Distill-Qwen-7B

Open source & experience

Code license
MIT License
Weights license
MIT License- 免费商用授权
GitHub repo
GitHub link unavailable
Hugging Face
https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Live demo
No live demo
DeepSeek-R1-Distill-Qwen-7B

Official resources

Paper
DeepSeek-R1-Distill-Qwen-7B
DataLearnerAI blog
No blog post yet
DeepSeek-R1-Distill-Qwen-7B

API details

API speed
No data
No public API pricing yet.
DeepSeek-R1-Distill-Qwen-7B

Benchmark Results

综合评估

1 evaluations
Benchmark / mode
Score
Rank/total
GPQA DiamondNormal
49.50
126 / 150

数学推理

2 evaluations
Benchmark / mode
Score
Rank/total
MATH-500Normal
91.40
31 / 42
AIME 2024Normal
53.30
46 / 62
查看评测深度分析与其他模型对比
DeepSeek-R1-Distill-Qwen-7B

Publisher

DeepSeek-AI
DeepSeek-AI
View publisher details
DeepSeek-R1-Distill-Qwen-7B

Model Overview

DeepseekAI使用DeepSeek R1模型对Qwen2.5-math-7B模型蒸馏得到的大模型,使Qwen2.5-math-7B有了推理能力。

DataLearner 官方微信

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送

DataLearner 官方微信二维码