GPT-5-mini Benchmark Details

GPT-5-mini currently shows benchmark results led by FrontierMath (18 / 60, score 19.30), FrontierMath - Tier 4 (35 / 80, score 6.30), LiveBench (51 / 115, score 65.91).

Benchmark Results

GPT-5-mini

Benchmark Results

Thinking
Tool usage

General Knowledge

8 evaluations
Benchmark / mode
Score
Rank/total
78
69 / 126
69
119 / 179
0
177 / 179
LiveBench
Standard Mode
61
66 / 115
LiveBench
Thinking Level · Low
53.07
85 / 115
LiveBench
Thinking Level · High
65.91
51 / 115
5
155 / 159
0
159 / 159

Coding and Software Engineer

2 evaluations
Benchmark / mode
Score
Rank/total
CodeClash
Standard ModeTools
1200
5 / 8
55
84 / 120

Math and Reasoning

6 evaluations
Benchmark / mode
Score
Rank/total
47
90 / 106
47
90 / 106
19.30
18 / 60
19
20 / 60
FrontierMath - Tier 4
Thinking Level · Medium
4.20
40 / 80
FrontierMath - Tier 4
Thinking Level · High
6.30
35 / 80

AI Agent - Tool Usage

1 evaluations
Benchmark / mode
Score
Rank/total

Claw-style Agent Evaluation

1 evaluations
Benchmark / mode
Score
Rank/total
Pinch Bench
Thinking EnabledTools
80.30
23 / 37