GPT-5.1 Codex Benchmark Details
GPT-5.1 Codex currently shows benchmark results led by Terminal-Bench (2 / 35, score 56.30), LiveCodeBench (8 / 109, score 85.50), SWE-bench Verified (41 / 95, score 70.40).
Benchmark Results
GPT-5.1 Codex
Benchmark Results
编程与软件工程
2 evaluationsBenchmark / mode
Score
Rank/total