o3-pro Benchmark Details
o3-pro currently shows benchmark results led by Aider-Polyglot (1 / 26, score 84.90), AIME 2024 (8 / 62, score 93), SWE-bench Verified (21 / 93, score 75).
Benchmark Results
o3-pro
o3-pro currently shows benchmark results led by Aider-Polyglot (1 / 26, score 84.90), AIME 2024 (8 / 62, score 93), SWE-bench Verified (21 / 93, score 75).