DeepSeek-R1

Name: DeepSeek-R1
Price: 0.55 USD
Availability: InStock
Author: DeepSeek-AI

Reasoning modelDeepSeek R1DeepSeek R1

DeepSeek-R1

Release date: 2025-01-20Updated: 2025-03-21 11:14:181,823

Live demoGitHubHugging Face Compare

Parameters

671B

Context length

128K

Chinese support

Supported

Reasoning ability

DeepSeek-R1 is an AI model published by DeepSeek-AI, released on 2025-01-20, for Reasoning model, with 671B parameters, and 128K context length, requiring about 134GB storage, with a 97.30 score on MATH-500.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeek-R1

Model basics

Reasoning traces

Supported

Thinking modes

Thinking modes not supported

Context length

128K tokens

Max output length

No data

Model type

Reasoning model

Modality (in / out)

No data

Release date

2025-01-20

Model file size

134GB

MoE architecture

Total params / Active params

671B / N/A

Knowledge cutoff

No data

DeepSeek-R1

Open source & experience

Code license

MIT License

Weights license

MIT License- Commercial use permitted

GitHub repo

GitHub link unavailable

Hugging Face

https://huggingface.co/deepseek-ai/DeepSeek-R1

Live demo

No live demo

DeepSeek-R1

Official resources

Paper

DataLearnerAI blog

DeepSeek-R1

API details

API speed

No data

💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.

Learn about pricing modes

Standard

Type	Condition	Input	Output
Text	-	$0.550/ 1M	$2.19/ 1M

Cache PricingPrompt Cache

Type	TTL	Write	Read
Text	-	$0.550/ 1M	$0.140/ 1M

DeepSeek-R1

Benchmark Results

DeepSeek-R1 currently shows benchmark results led by MMLU (8 / 66, score 90.80), MMLU Pro (38 / 132, score 84), MATH-500 (13 / 44, score 97.30). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.