DeepSeekMoE 145B Base
DeepSeekMoE 145B Base is an AI model published by DeepSeek-AI, released on 2024-01-11, for 基础大模型, with 1446.0B parameters, and 4K tokens context length, requiring about 288GB storage, under the DEEPSEEK LICENSE AGREEMENT license.
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
DeepSeekMoE是幻方量化旗下大模型企业DeepSeek开源的一个混合专家大模型,也是目前已知的中国第一个开源的MoE大模型。
DeepSeekMoE 145B Base是其1446亿参数的版本。
欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送
