GPT-4o Benchmark Details
GPT-4o currently shows benchmark results led by HumanEval (8 / 39, score 90), MMLU (14 / 64, score 88.70), BBH (5 / 20, score 91.70).
Benchmark Results
GPT-4o
GPT-4o currently shows benchmark results led by HumanEval (8 / 39, score 90), MMLU (14 / 64, score 88.70), BBH (5 / 20, score 91.70).