DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
Page navigation
目录
Model catalogMoonlight-16B-A3B-Instruct
MO

Moonlight-16B-A3B-Instruct

Moonlight-16B-A3B-Instruct

Release date: 2025-02-23更新于: 2025-02-23 21:20:35706
Live demoGitHubHugging FaceCompare
Parameters
160.0亿
Context length
8K
Chinese support
Supported
Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Moonlight-16B-A3B-Instruct

Model basics

Reasoning traces
Not supported
Context length
8K tokens
Max output length
No data
Model type
聊天大模型
Release date
2025-02-23
Model file size
32GB
MoE architecture
No
Total params / Active params
160.0B / N/A
Knowledge cutoff
No data
Inference modes
No mode data
Moonlight-16B-A3B-Instruct

Open source & experience

Code license
MIT License
Weights license
MIT License- 免费商用授权
GitHub repo
https://github.com/MoonshotAI/Moonlight
Hugging Face
https://huggingface.co/moonshotai/Moonlight-16B-A3B-Instruct
Live demo
No live demo
Moonlight-16B-A3B-Instruct

Official resources

Paper
Muon is Scalable for LLM Training
DataLearnerAI blog
月之暗面开源了一个全新的160亿参数规模的MoE大语言模型Moonlight-16B:其训练算力仅需业界主流的一半
Moonlight-16B-A3B-Instruct

API details

API speed
No data
No public API pricing yet.
Moonlight-16B-A3B-Instruct

Benchmark Results

综合评估

3 evaluations
Benchmark / mode
Score
Rank/total
MMLUNormal
70
53 / 59
BBHNormal
65.20
12 / 18
MMLU ProNormal
42.40
107 / 112

数学推理

2 evaluations
Benchmark / mode
Score
Rank/total
GSM8KNormal
77.40
17 / 24
MATHNormal
45.30
36 / 41

编程与软件工程

2 evaluations
Benchmark / mode
Score
Rank/total
MBPPNormal
63.80
21 / 27
HumanEvalNormal
48.10
30 / 36
查看评测深度分析与其他模型对比
Moonlight-16B-A3B-Instruct

Publisher

Moonshot AI
Moonshot AI
View publisher details
Moonlight-16B-A3B-Instruct

Model Overview

月之暗面开源的一个160以参数的混合专家大模型,每次推理参数共30亿。效果超过同类型的大模型。


本版本是其指令优化后的版本。




关于Moonlight-16B-A3B模型的详细介绍参考DataLearnerAI的官方博客: https://www.datalearner.com/blog/1051740316091143 

DataLearner 官方微信

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送

DataLearner 官方微信二维码