DeepSeek-R1-Distill-Llama-70B
DeepSeek-R1-Distill-Llama-70B is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 700.0B parameters, and 128K tokens context length, requiring about 140GB storage, under the MIT License license.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
DeepSeek-R1-Distill-Llama-70B currently shows benchmark results led by MATH-500 (27 / 44, score 94.50), GPQA Diamond (126 / 175, score 65.20). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.
DeepSeek-R1-Distill-Llama-70B is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 700.0B parameters, and 128K tokens context length, requiring about 140GB storage, under the MIT License license.
Follow DataLearner on WeChat for AI model updates and research notes.
