DeepSeek-R1-Distill-Qwen-7B
DeepSeek-R1-Distill-Qwen-7B is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 70.0B parameters, and 128K tokens context length, requiring about 14GB storage, under the MIT License license.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
DeepSeek-R1-Distill-Qwen-7B currently shows benchmark results led by AIME 2024 (45 / 62, score 53.30), MATH-500 (32 / 44, score 91.40), GPQA Diamond (151 / 175, score 49.50). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.
DeepSeek-R1-Distill-Qwen-7B is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 70.0B parameters, and 128K tokens context length, requiring about 14GB storage, under the MIT License license.
Follow DataLearner on WeChat for AI model updates and research notes.

No curated comparisons for this model yet.
Want a custom combination? Open the compare tool