DE

DeepSeek V2.5

Foundation modelDeepSeek VDeepSeek V2.5

DeepSeek V2.5 - 236B

Release date: 2024-09-05Updated: 2024-11-23 18:18:231,066
Parameters
236B
Context length
128K
Chinese support
Supported
Reasoning ability

DeepSeek V2.5 - 236B is an AI model published by DeepSeek-AI, released on 2024-09-05, for Foundation model, with 236B parameters, and 128K context length, requiring about 133GB storage, under the DEEPSEEK LICENSE AGREEMENT license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeek V2.5

Model basics

Reasoning traces
Not supported
Thinking modes
Thinking modes not supported
Context length
128K tokens
Max output length
No data
Model type
Foundation model
Modality (in / out)
No data
Release date
2024-09-05
Model file size
133GB
MoE architecture
No
Total params / Active params
236B / N/A
Knowledge cutoff
No data
DeepSeek V2.5

Open source & experience

DeepSeek V2.5

Official resources

DeepSeek V2.5

API details

API speed
No data
No public API pricing yet.
DeepSeek V2.5

Benchmark Results

No benchmark data to show.

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

DeepSeek V2.5

Publisher

DeepSeek V2.5 - 236B

Model Overview

DeepSeek V2.5 - 236B is an AI model published by DeepSeek-AI, released on 2024-09-05, for Foundation model, with 236B parameters, and 128K context length, requiring about 133GB storage, under the DEEPSEEK LICENSE AGREEMENT license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code