Mistral Large 3 Benchmark Details

Mistral Large 3 currently shows benchmark results led by Pinch Bench (28 / 37, score 72.20), Claw Bench (22 / 29, score 78.60), Simple Bench (58 / 63, score 20.40).

Benchmark Results

Mistral Large 3

Benchmark Results

Thinking
Tool usage

常识推理

1 evaluations
Benchmark / mode
Score
Rank/total
Simple Bench
Standard Mode
20.40
58 / 63

Claw-style Agent Evaluation

2 evaluations
Benchmark / mode
Score
Rank/total
Claw Bench
Thinking EnabledTools
78.60
22 / 29
Pinch Bench
Thinking EnabledTools
72.20
28 / 37