MU

Muse Spark

Reasoning model

Muse Spark by Meta Superintelligence Labs

Release date: 2026-04-08Updated: 2026-04-09 13:19:26.5982,471
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
262K
Chinese support
Not supported
Reasoning ability

Meta Muse Spark 是 Meta Superintelligence Labs 于 2026 年 4 月发布的首款模型,也是 Llama 4 失利后 Meta 全面重建 AI 研发体系的第一个对外成果。模型由首席 AI 官 Alexandr Wang 领导团队历时九个月开发完成,原生支持多模态输入,内置多智能体并行推理机制。基准测试中,Muse Spark 在医疗问答(HealthBench Hard 42.8%)和图表理解(CharXiv Reasoning 86.4)上表现突出,整体推理和智能体编码能力与 GPT-5.4、Gemini 3.1 Pro 仍有差距。Meta 将其定位为 Muse 系列的起点,更大规模的后续模型已在开发中。

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Muse Spark

Model basics

Reasoning traces
Supported
Thinking modes
Thinking Mode (Default)Standard Mode
Context length
262K tokens
Max output length
No data
Model type
Reasoning model
Modality (in / out)
Text, Image, Audio, Video → Text
Release date
2026-04-08
Model file size
No data
MoE architecture
No
Total params / Active params
No data / N/A
Knowledge cutoff
No data
Muse Spark

Open source & experience

Code license
不开源
Weights license
不开源
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Muse Spark

Official resources

Muse Spark

API details

API speed
3/5
No public API pricing yet.
Muse Spark

Benchmark Results

Muse Spark currently shows benchmark results led by HLE (4 / 159, score 58), GPQA Diamond (23 / 179, score 89.50), FrontierMath (9 / 60, score 39). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking
Tool usage
Parallel

General Knowledge

5 evaluations
Benchmark / mode
Score
Rank/total
GPQA Diamond
Thinking Mode
89.50
23 / 179
HLE
Thinking Mode
42.80
42 / 159
HLE
Thinking ModeTools
50.40
19 / 159
HLE
Deep
58
4 / 159
ARC-AGI-2
Thinking Mode
42.50
25 / 59

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
SWE-bench Verified
Thinking ModeTools
77.40
24 / 108

Math and Reasoning

3 evaluations
Benchmark / mode
Score
Rank/total
FrontierMath
Thinking Mode
39
9 / 60
14.60
23 / 80
14.60
23 / 80

Agent Level Benchmark

1 evaluations
Benchmark / mode
Score
Rank/total
τ²-Bench - Telecom
Thinking ModeTools
92
20 / 35

AI Agent - Tool Usage

1 evaluations
Benchmark / mode
Score
Rank/total
Terminal Bench 2.0
Thinking ModeTools
59
24 / 46

Productivity Knowledge

1 evaluations
Benchmark / mode
Score
Rank/total
GDPval-AA
Thinking ModeTools
1444
5 / 21

Compare with other models

Muse Spark

Publisher

Facebook AI研究实验室
View publisher details
Muse Spark by Meta Superintelligence Labs

Model Overview

Meta Muse Spark 是 Meta Superintelligence Labs 于 2026 年 4 月发布的首款模型,也是 Llama 4 失利后 Meta 全面重建 AI 研发体系的第一个对外成果。模型由首席 AI 官 Alexandr Wang 领导团队历时九个月开发完成,原生支持多模态输入,内置多智能体并行推理机制。基准测试中,Muse Spark 在医疗问答(HealthBench Hard 42.8%)和图表理解(CharXiv Reasoning 86.4)上表现突出,整体推理和智能体编码能力与 GPT-5.4、Gemini 3.1 Pro 仍有差距。Meta 将其定位为 Muse 系列的起点,更大规模的后续模型已在开发中。

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code