Opus 4.7 vs Claude Opus 4.6: benchmarks, pricing and specs

Opus 4.7 vs Claude Opus 4.6: benchmarks, pricing and specs | DataLearnerAI

Benchmark scores

Grouped by capability, sorted by largest gap within each. 12 shared benchmarks.

Opus 4.7 5/6

Benchmark	Opus 4.7	Claude Opus 4.6	Diff
ARC-AGI-2	75.809 / 58最高（无工具）	66.3014 / 58Extended (no tools)	+9.50
GPQA Diamond	94.204 / 175Extended (no tools)	91.3112 / 175Extended (no tools)	+2.89
HLE	54.706 / 149Extended (with tools)	538 / 149Extended (with tools, internet)	+1.70
ARC-AGI	93.509 / 65Thinking High (No Tools)	9211 / 65Extended (no tools)	+1.50
MMLU	91.506 / 65Normal (No Tools)	91.057 / 65Extended (no tools)	+0.45
ARC-AGI-3	05 / 6Thinking High (No Tools)	01 / 6最高（无工具）	—

Opus 4.7 2/2

Benchmark	Opus 4.7	Claude Opus 4.6	Diff
OSWorld-Verified	783 / 14Extended (with tools)	72.706 / 14Extended (with tools)	+5.30
Terminal Bench 2.0	69.405 / 43Extended (with tools)	65.409 / 43Extended (with tools)	+4

Opus 4.7 1/2

Benchmark	Opus 4.7	Claude Opus 4.6	Diff
FrontierMath	43.806 / 60极高强度思考（无工具）	40.707 / 60最高（无工具）	+3.10
FrontierMath - Tier 4	22.9012 / 80极高强度思考（无工具）	22.9012 / 80最高（无工具）	—

Claude Opus 4.6 1/1

Benchmark	Opus 4.7	Claude Opus 4.6	Diff
BrowseComp	79.3011 / 43Extended (with tools)	846 / 43Thinking (With Tools + Internet)	-4.70

Opus 4.7 1/1

Benchmark	Opus 4.7	Claude Opus 4.6	Diff
SWE-bench Verified	87.602 / 103Extended (with tools)	80.846 / 103Extended (with tools)	+6.76

Prices use DataLearner records when available; missing fields are not inferred.