DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Page navigation
Page navigation
Model catalogOpus 4.5
OP

Opus 4.5

Reasoning model

Claude Opus 4.5

Release date: 2025-11-25Updated: 2026-04-19 11:39:19Knowledge cutoff: 2025-032,243
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
200K
Chinese support
Supported
Reasoning ability

Claude Opus 4.5 is an AI model published by Anthropic, released on 2025-11-25, for Reasoning model, and 200K tokens context length, with no open-source license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Opus 4.5

Model basics

Reasoning traces
Supported
Thinking modes
Thinking Level · Extended (Default)Standard Mode
Context length
200K tokens
Max output length
65536 tokens
Model type
Reasoning model
Release date
2025-11-25
Model file size
No data
MoE architecture
No
Total params / Active params
No data / N/A
Knowledge cutoff
2025-03
Opus 4.5

Open source & experience

Code license
Not open source
Weights license
Not open source
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Live demo
https://claude.ai/
Opus 4.5

Official resources

Paper
Introducing Claude Opus 4.5
DataLearnerAI blog
No blog post yet
Opus 4.5

API details

API speed
3/5
💡Default unit: $/1M tokens. If vendors use other units, follow their published pricing.
Learn about pricing modes
Standard
TypeConditionInputOutput
Text-$5.00/ 1M$25.00/ 1M
Batch
TypeConditionInputOutput
Text-$2.50/ 1M$12.50/ 1M
Cache PricingPrompt Cache
TypeTTLWriteRead
Text5m$6.25/ 1M$0.500/ 1M
Opus 4.5

Benchmark Results

Opus 4.5 currently shows benchmark results led by MMLU Pro (2 / 126, score 90), SWE-bench Verified (5 / 105, score 80.90), Terminal Bench Hard (1 / 13, score 44). This page also consolidates core specs, context limits, and API pricing so you can evaluate the model from benchmark results and deployment constraints together.

Thinking
Tool usage

General Knowledge

8 evaluations
Benchmark / mode
Score
Rank/total
MMLU Pro
Extended
90
2 / 126
GPQA Diamond
Extended
87
37 / 177
ARC-AGI
Extended
80
21 / 65
LiveBench
Medium
74.87
9 / 52
LiveBench
High
75.58
6 / 52
HLE
Extended
30.80
67 / 154
HLE
ExtendedTools
43.20
36 / 154
ARC-AGI-2
Extended
37.60
26 / 59

Coding and Software Engineer

2 evaluations
Benchmark / mode
Score
Rank/total
LiveCodeBench
ExtendedTools
87
12 / 120
SWE-bench Verified
ExtendedTools
80.90
5 / 105

Multimodal Understanding

1 evaluations
Benchmark / mode
Score
Rank/total
MMMU
Extended
80.70
10 / 28

Math and Reasoning

1 evaluations
Benchmark / mode
Score
Rank/total
Simple Bench
Extended
62
3 / 27

Agent Level Benchmark

3 evaluations
Benchmark / mode
Score
Rank/total
τ²-Bench - Telecom
ExtendedTools
90.70
21 / 35
τ²-Bench
ExtendedTools
81.99
13 / 40
Terminal Bench Hard
ExtendedTools
44
1 / 13

Math and Reasoning

6 evaluations
Benchmark / mode
Score
Rank/total
AIME 2026
Extended
93.30
5 / 14
FrontierMath
Extended
20.70
17 / 60
FrontierMath - Tier 4
Standard Mode
4.20
40 / 80
FrontierMath - Tier 4
16K
2.10
56 / 80
FrontierMath - Tier 4
32K
4.20
40 / 80
FrontierMath - Tier 4
Extended
4.20
40 / 80

Instruction Following

1 evaluations
Benchmark / mode
Score
Rank/total
IF Bench
ExtendedTools
58
20 / 29

AI Agent - Tool Usage

1 evaluations
Benchmark / mode
Score
Rank/total
Terminal Bench 2.0
ExtendedTools
59.30
20 / 46

Claw-style Agent Evaluation

2 evaluations
Benchmark / mode
Score
Rank/total
Claw Bench
ExtendedTools
91.50
7 / 29
Pinch Bench
ExtendedTools
87.20
8 / 37
View benchmark analysisCompare with other models
Opus 4.5

Publisher

Anthropic
Anthropic
View publisher details
Claude Opus 4.5

Model Overview

Claude Opus 4.5 is an AI model published by Anthropic, released on 2025-11-25, for Reasoning model, and 200K tokens context length, with no open-source license.

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.

DataLearner WeChat QR code

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool