GPT-5.1-Codex-Max Benchmark Details
GPT-5.1-Codex-Max currently shows benchmark results led by Terminal-Bench (1 / 35, score 58.10), SWE-bench Verified (24 / 105, score 76.80).
Benchmark Results
GPT-5.1-Codex-Max
GPT-5.1-Codex-Max currently shows benchmark results led by Terminal-Bench (1 / 35, score 58.10), SWE-bench Verified (24 / 105, score 76.80).