DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeEvaluation Overview大模型编程能力评测排行榜

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Top Model

Llama3.3-70B-Instruct

Top Score

-

Model Count

14

Data version

-

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above

Ranking Table

ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Llama3.3-70B-Instruct700.088.4087.60Facebook AI研究实验室/
Qwen2-72B-Instruct720.08680.20阿里巴巴/
Llama3-70B

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

700.0
81.70
/
Facebook AI研究实验室
/
Llama3-70B-Instruct700.081.70/Facebook AI研究实验室/
Llama3.1-70B-Instruct700.080.5086Facebook AI研究实验室/
Gemini-pro1000.067.70/DeepMind/
Qwen2-72B727.064.6076.90阿里巴巴/
Qwen2.5-72B727.059.1084.70阿里巴巴/
Qwen2-57B-A14B570.05371.90阿里巴巴/
Qwen1.5-72B-Chat720.041.5053.40阿里巴巴/
Mixtral-8×7B-MoE450.040.2060.70MistralAI/
Qwen-72B720.035.4052.20阿里巴巴/
LLaMA2 70B700.030.5045.40Facebook AI研究实验室/
XVERSE-65B650.026.80/元象XVERSE/