Opus 4.5 Benchmark Details
Opus 4.5 currently shows benchmark results led by MMLU Pro (2 / 126, score 90), SWE-bench Verified (5 / 105, score 80.90), Terminal Bench Hard (1 / 13, score 44). 3 source links are attached for reference.
Benchmark Results
Opus 4.5
Benchmark Results
综合评估
8 evaluationsBenchmark / mode
Score
Rank/total
编程与软件工程
2 evaluationsBenchmark / mode
Score
Rank/total
Agent能力评测
3 evaluationsBenchmark / mode
Score
Rank/total
数学推理
6 evaluationsBenchmark / mode
Score
Rank/total
OpenClaw智能体能力综合测评
2 evaluationsBenchmark / mode
Score
Rank/total