GPT-5.1-Codex-Max Benchmark Details
GPT-5.1-Codex-Max currently shows benchmark results led by Terminal-Bench (1 / 35, score 58.10), SWE-bench Verified (16 / 95, score 76.80).
Benchmark Results
GPT-5.1-Codex-Max
GPT-5.1-Codex-Max currently shows benchmark results led by Terminal-Bench (1 / 35, score 58.10), SWE-bench Verified (16 / 95, score 76.80).