GPT-4o Benchmark Details
GPT-4o currently shows benchmark results led by HumanEval (8 / 39, score 90), MMLU (15 / 65, score 88.70), BBH (5 / 20, score 91.70).
Benchmark Results
GPT-4o
Benchmark Results
综合评估
5 evaluationsBenchmark / mode
Score
Rank/total
编程与软件工程
4 evaluationsBenchmark / mode
Score
Rank/total
数学推理
5 evaluationsBenchmark / mode
Score
Rank/total