DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
Page navigation
Model catalogClaude Opus 4
CL

Claude Opus 4

Reasoning model

Claude Opus 4

Release date: 2025-05-23Updated: 2025-05-25 09:48:391,578
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
200K
Chinese support
Supported
Reasoning ability

Claude Opus 4 is an AI model published by Anthropic, released on 2025-05-23, for Reasoning model, and 200K tokens context length, with no open-source license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Claude Opus 4

Model basics

Reasoning traces
Supported
Thinking modes
Thinking modes not supported
Context length
200K tokens
Max output length
32000 tokens
Model type
Reasoning model
Release date
2025-05-23
Model file size
No data
MoE architecture
No
Total params / Active params
No data / N/A
Knowledge cutoff
No data
Claude Opus 4

Open source & experience

Code license
Not open source
Weights license
Not open source
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Live demo
https://claude.ai/new
Claude Opus 4

Official resources

Paper
Introducing Claude 4
DataLearnerAI blog
DataLearnerAI blog
Claude Opus 4

API details

API speed
3/5
💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.
Standard pricingStandard
ModalityInputOutput
Text$15$75
Image$15--
Claude Opus 4

Benchmark Results

Claude Opus 4 currently shows benchmark results led by MATH-500 (3 / 44, score 98.20), MMLU Pro (23 / 124, score 85), Simple Bench (7 / 27, score 58.80). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking

General Knowledge

5 evaluations
Benchmark / mode
Score
Rank/total
MMLU Pro
Standard Mode
85
23 / 124
GPQA Diamond
Standard Mode
79.60
76 / 175
ARC-AGI
Standard Mode
35.70
48 / 65
HLE
Standard Mode
10.70
121 / 149
ARC-AGI-2
Standard Mode
8.60
38 / 58

Coding and Software Engineer

2 evaluations
Benchmark / mode
Score
Rank/total
SWE-bench Verified
Standard Mode
72.50
43 / 103
LiveCodeBench
Standard Mode
56.60
74 / 118

Math and Reasoning

9 evaluations
Benchmark / mode
Score
Rank/total
MATH-500
Standard Mode
98.20
3 / 44
AIME 2024
Standard Mode
76
35 / 62
AIME2025
Standard Mode
75.50
65 / 106
FrontierMath
Standard Mode
4.50
39 / 60
FrontierMath
Thinking Mode
4.10
41 / 60
FrontierMath - Tier 4
Standard Mode
0
72 / 80
FrontierMath - Tier 4
Thinking Mode
4.20
40 / 80
FrontierMath - Tier 4
32K
4.20
40 / 80
IMO-ProofBench
Thinking Mode
2.90
16 / 16

Writing and Creative Capabilities

1 evaluations
Benchmark / mode
Score
Rank/total
Creative Writing
Standard Mode
83.75
13 / 23

Math and Reasoning

1 evaluations
Benchmark / mode
Score
Rank/total
Simple Bench
Thinking Mode
58.80
7 / 27

Agent Level Benchmark

3 evaluations
Benchmark / mode
Score
Rank/total
τ²-Bench
72.50
22 / 40
Aider-Polyglot
Standard Mode
70.10
11 / 26
Aider-Polyglot
Thinking Mode
72
8 / 26
View benchmark analysisCompare with other models

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

Claude Opus 4

Publisher

Anthropic
Anthropic
View publisher details
Claude Opus 4

Model Overview

Claude Opus 4 is an AI model published by Anthropic, released on 2025-05-23, for Reasoning model, and 200K tokens context length, with no open-source license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code