LL

Llama 4 Behemoth Instruct

Multimodal modelLlama 4

Llama-4-Behemoth-17B-128E-Instruct

Release date: 2025-04-05Updated: 2025-04-06 08:27:261,219
Parameters
2T
Context length
1000K
Chinese support
Supported
Reasoning ability

Llama-4-Behemoth-17B-128E-Instruct is an AI model published by Facebook AI研究实验室, released on 2025-04-05, for Multimodal model, with 2T parameters, and 1000K context length, requiring about 4000GB storage, under the Llama4 License license, with a 95.00 score on MATH-500.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Llama 4 Behemoth Instruct

Model basics

Reasoning traces
Not supported
Thinking modes
Thinking modes not supported
Context length
1000K tokens
Max output length
4K tokens
Model type
Multimodal model
Modality (in / out)
Text, Image, Audio, Video → Text
Release date
2025-04-05
Model file size
4000GB
MoE architecture
No
Total params / Active params
2T / N/A
Knowledge cutoff
No data
Llama 4 Behemoth Instruct

Open source & experience

Llama 4 Behemoth Instruct

Official resources

Llama 4 Behemoth Instruct

API details

API speed
3/5
No public API pricing yet.
Llama 4 Behemoth Instruct

Benchmark Results

Llama 4 Behemoth Instruct currently shows benchmark results led by MMLU Pro (49 / 126, score 82.20), GPQA Diamond (98 / 179, score 73.70), MATH-500 (25 / 44, score 95). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking

General Knowledge

2 evaluations
Benchmark / mode
Score
Rank/total
82.20
49 / 126
73.70
98 / 179

Math and Reasoning

1 evaluations
Benchmark / mode
Score
Rank/total
95
25 / 44

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
49.40
92 / 120

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

Llama 4 Behemoth Instruct

Publisher

Facebook AI研究实验室
View publisher details
Llama-4-Behemoth-17B-128E-Instruct

Model Overview

Llama-4-Behemoth-17B-128E-Instruct is an AI model published by Facebook AI研究实验室, released on 2025-04-05, for Multimodal model, with 2T parameters, and 1000K context length, requiring about 4000GB storage, under the Llama4 License license, with a 95.00 score on MATH-500.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code