Qwen2.5-3B Benchmark Details

Qwen2.5-3B currently shows benchmark results led by GSM8K (17 / 26, score 79.10), BBH (16 / 20, score 56.30), MBPP (24 / 28, score 57.10).

Benchmark Results

Qwen2.5-3B

Benchmark Results

Thinking

General Knowledge

4 evaluations
Benchmark / mode
Score
Rank/total
65.60
63 / 65
56.30
16 / 20
34.60
123 / 126
24.30
176 / 179

Math and Reasoning

2 evaluations
Benchmark / mode
Score
Rank/total
79.10
17 / 26
42.60
37 / 42

Coding and Software Engineer

2 evaluations
Benchmark / mode
Score
Rank/total
57.10
24 / 28
42.10
34 / 39