Claude Opus 4

Name: Claude Opus 4
Author: Anthropic

Reasoning modelOpusClaude 4

Claude Opus 4

Release date: 2025-05-23Updated: 2025-05-25 09:48:391,706

Live demoGitHubHugging FaceCompare

Parameters

Not disclosed

Context length

200K

Chinese support

Supported

Reasoning ability

Claude Opus 4 is an AI model published by Anthropic, released on 2025-05-23, for Reasoning model, and 200K context length, under the 不开源 license, with a 98.20 score on MATH-500.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Claude Opus 4

Model basics

Reasoning traces

Supported

Thinking modes

Thinking modes not supported

Context length

200K tokens

Max output length

32K tokens

Model type

Reasoning model

Modality (in / out)

Text, Image → Text

Release date

2025-05-23

Model file size

No data

MoE architecture

Total params / Active params

No data / N/A

Knowledge cutoff

No data

Claude Opus 4

Open source & experience

Code license

不开源

Weights license

不开源

GitHub repo

GitHub link unavailable

Hugging Face

Hugging Face link unavailable

Live demo

https://claude.ai/new

Claude Opus 4

Official resources

Paper

Introducing Claude 4

DataLearnerAI blog

Claude Opus 4

API details

API speed

3/5

No public API pricing yet.

Claude Opus 4

Benchmark Results

Claude Opus 4 currently shows benchmark results led by MATH-500 (3 / 44, score 98.20), MMLU Pro (25 / 126, score 85), Aider-Polyglot (13 / 59, score 72). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

General Knowledge

5 evaluations

Benchmark / mode

Score

Rank/total

MMLU Pro

25 / 126

GPQA Diamond

79.60

80 / 179

ARC-AGI

35.70

48 / 65

HLE

10.70

131 / 159

ARC-AGI-2

8.60

39 / 59

Coding and Software Engineer

2 evaluations

Benchmark / mode

Score

Rank/total

SWE-bench Verified

72.50

48 / 108

LiveCodeBench

56.60

76 / 120

Math and Reasoning

9 evaluations

Benchmark / mode

Score

Rank/total

MATH-500

98.20

3 / 44

AIME 2024

35 / 62

AIME2025

75.50

65 / 106

FrontierMath

4.50

39 / 60

FrontierMath

4.10

41 / 60

FrontierMath - Tier 4

Standard Mode

72 / 80

FrontierMath - Tier 4

4.20

40 / 80

FrontierMath - Tier 4

32K

4.20

40 / 80

IMO-ProofBench

2.90

16 / 16

Writing and Creative Capabilities

1 evaluations

Benchmark / mode

Score

Rank/total

Creative Writing

83.75

13 / 23

常识推理

1 evaluations

Benchmark / mode

Score

Rank/total

Simple Bench

Thinking Mode

58.80

17 / 63

Agent Level Benchmark

3 evaluations

Benchmark / mode

Score

Rank/total

τ²-Bench

72.50

22 / 40

Aider-Polyglot

Standard Mode

70.70

16 / 59

Aider-Polyglot

32K

13 / 59

View benchmark analysis Compare with other models

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

Claude Opus 4

Publisher

Anthropic

View publisher details

Claude Opus 4

Model Overview

Claude Opus 4 is an AI model published by Anthropic, released on 2025-05-23, for Reasoning model, and 200K context length, under the 不开源 license, with a 98.20 score on MATH-500.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.