DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
Page navigation
目录
Model catalogClaude3-Opus
CL

Claude3-Opus

Claude3-Opus

Release date: 2024-03-04更新于: 2024-04-18 08:32:01788
Live demoGitHubHugging FaceCompare
Parameters
Not disclosed
Context length
200K
Chinese support
Supported
Reasoning ability

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Claude3-Opus

Model basics

Reasoning traces
Not supported
Context length
200K tokens
Max output length
No data
Model type
多模态大模型
Release date
2024-03-04
Model file size
No data
MoE architecture
No
Total params / Active params
0.0B / N/A
Knowledge cutoff
No data
Inference modes
No mode data
Claude3-Opus

Open source & experience

Code license
不开源
Weights license
不开源- 不开源
GitHub repo
GitHub link unavailable
Hugging Face
Hugging Face link unavailable
Live demo
No live demo
Claude3-Opus

Official resources

Paper
Introducing the next generation of Claude
DataLearnerAI blog
评测结果超过GPT-4,Anthropic发布第三代大语言模型Claude3,具有多模态能力,实际评测表现优秀!
Claude3-Opus

API details

API speed
No data
No public API pricing yet.
Claude3-Opus

Benchmark Results

综合评估

3 evaluations
Benchmark / mode
Score
Rank/total
MMLUNormal
86.80
23 / 59
MMLU ProNormal
68.45
81 / 112
GPQA DiamondNormal
50.40
127 / 153

数学推理

2 evaluations
Benchmark / mode
Score
Rank/total
GSM8KNormal
95
7 / 24
MATHNormal
60.10
31 / 41

编程与软件工程

1 evaluations
Benchmark / mode
Score
Rank/total
HumanEvalNormal
84.90
20 / 36

常识推理

1 evaluations
Benchmark / mode
Score
Rank/total
HellaSwagNormal
95.40
1 / 1

阅读理解

1 evaluations
Benchmark / mode
Score
Rank/total
DROPNormal
83.10
6 / 6
查看评测深度分析与其他模型对比
Claude3-Opus

Publisher

Anthropic
Anthropic
View publisher details
Claude3-Opus

Model Overview

Claude3-Opus是Anthropic公司发布的第三代多模态大语言模型。第三代的Claude-3模型包含3个版本,这里说的Claude3-Opus是其中能力最强的模型。各项评测人任务结果都非常好,甚至超过了GPT-4。


在多模态方面,Claude3-Opus也有强大的能力。


Claude2最受诟病的就是无效的拒绝回答。由于Anthropic在对齐方面做了严格的工作,导致Claude2.1经常出现拒绝回答的情况。在Claude3-Opus上。Anthropic做了改进,在内部测试中,Claude2.1错误地拒绝比例大概在26%左右,而Claude3-Opus上这个比例下降到了11%,进步明显!

DataLearner 官方微信

欢迎关注 DataLearner 官方微信,获得最新 AI 技术推送

DataLearner 官方微信二维码