Llama 4 Maverick Benchmark Details

Llama 4 Maverick currently shows benchmark results led by MBPP (13 / 28, score 77.60), MMLU (38 / 65, score 85.50), MATH (30 / 42, score 61.20). 1 source link is attached for reference.

Benchmark Results

Llama 4 Maverick

Benchmark Results

Thinking
Tool usage

General Knowledge

4 evaluations
Benchmark / mode
Score
Rank/total
85.50
38 / 65
62.90
103 / 126
ARC-AGI
Thinking Enabled
4.40
63 / 65
ARC-AGI-2
Thinking Enabled
0
57 / 59

Coding and Software Engineer

1 evaluations
Benchmark / mode
Score
Rank/total
77.60
13 / 28

Math and Reasoning

2 evaluations
Benchmark / mode
Score
Rank/total
61.20
30 / 42
0.70
55 / 60

Claw-style Agent Evaluation

1 evaluations
Benchmark / mode
Score
Rank/total
Pinch Bench
Thinking EnabledTools
46.10
36 / 37

Sources