GPT-5.1 Codex Benchmark Details
GPT-5.1 Codex currently shows benchmark results led by Terminal-Bench (2 / 35, score 56.30), LiveCodeBench (15 / 120, score 85.50), SWE-bench Verified (54 / 108, score 70.40).
Benchmark Results
GPT-5.1 Codex
Benchmark Results
Coding and Software Engineer
2 evaluationsBenchmark / mode
Score
Rank/total