GPT-4.1 Benchmark Details
GPT-4.1 currently shows benchmark results led by MMLU (9 / 65, score 90.20), GSM8K (5 / 26, score 95.90), DROP (4 / 9, score 89.20).
Benchmark Results
GPT-4.1
Benchmark Results
综合评估
4 evaluationsBenchmark / mode
Score
Rank/total
数学推理
6 evaluationsBenchmark / mode
Score
Rank/total
编程与软件工程
4 evaluationsBenchmark / mode
Score
Rank/total