Opus 4.1 Benchmark Details
Opus 4.1 currently shows benchmark results led by MMLU Pro (7 / 126, score 88), LiveBench (7 / 52, score 75.25), Terminal-Bench (5 / 35, score 46.50). 1 source link is attached for reference.
Benchmark Results
Opus 4.1
Benchmark Results
综合评估
4 evaluationsBenchmark / mode
Score
Rank/total
数学推理
7 evaluationsBenchmark / mode
Score
Rank/total
AI Agent - 工具使用
2 evaluationsBenchmark / mode
Score
Rank/total