DeepSeek-R1-0528-Qwen3-8B
DeepSeek-R1-0528-Qwen3-8B is an AI model published by DeepSeek-AI, released on 2025-05-30, for 推理大模型, with 80.0B parameters, and 64K tokens context length, requiring about 16GB storage, under the MIT License license.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
DeepSeekAI使用DeepSeek-R1-0528对Qwen3-8B蒸馏得到的
欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送
