DE

DeepSeekMoE 145B Chat

Chat modelDeepSeekMoE

DeepSeekMoE 145B Chat

Release date: 2024-01-11Updated: 2024-01-11 14:41:10.232629
Parameters
144.6B
Context length
4K
Chinese support
Supported
Reasoning ability

DeepSeekMoE 145B Chat is an AI model published by DeepSeek-AI, released on 2024-01-11, for Chat model, with 144.6B parameters, and 4K context length, requiring about 290GB storage, under the DEEPSEEK LICENSE AGREEMENT license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

DeepSeekMoE 145B Chat

Model basics

Reasoning traces
Not supported
Thinking modes
Thinking modes not supported
Context length
4K tokens
Max output length
No data
Model type
Chat model
Modality (in / out)
No data
Release date
2024-01-11
Model file size
290GB
MoE architecture
No
Total params / Active params
144.6B / N/A
Knowledge cutoff
No data
DeepSeekMoE 145B Chat

Open source & experience

Code license
Weights license
DEEPSEEK LICENSE AGREEMENT- 免费商用授权
Live demo
No live demo
DeepSeekMoE 145B Chat

Official resources

DeepSeekMoE 145B Chat

API details

API speed
No data
No public API pricing yet.
DeepSeekMoE 145B Chat

Benchmark Results

No benchmark data to show.

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

DeepSeekMoE 145B Chat

Publisher

DeepSeekMoE 145B Chat

Model Overview

DeepSeekMoE 145B Chat is an AI model published by DeepSeek-AI, released on 2024-01-11, for Chat model, with 144.6B parameters, and 4K context length, requiring about 290GB storage, under the DEEPSEEK LICENSE AGREEMENT license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code