DataLearner logoDataLearnerAI
AI Tech Blogs
Leaderboards
Benchmarks
Models
Resources
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
← Back to Main Leaderboard

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above
ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Llama3.3-70B-Instruct700.088.4087.60Facebook AI研究实验室/
Qwen2-72B-Instruct720.08680.20阿里巴巴/
Llama3-70B700.081.70/Facebook AI研究实验室

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

/
Llama3-70B-Instruct700.081.70/Facebook AI研究实验室/
Llama3.1-70B-Instruct700.080.5086Facebook AI研究实验室/
Gemini-pro1000.067.70/DeepMind/
Qwen2-72B727.064.6076.90阿里巴巴/
Qwen2.5-72B727.059.1084.70阿里巴巴/
Qwen2-57B-A14B570.05371.90阿里巴巴/
Qwen1.5-72B-Chat720.041.5053.40阿里巴巴/
Mixtral-8×7B-MoE450.040.2060.70MistralAI/
Qwen-72B720.035.4052.20阿里巴巴/
LLaMA2 70B700.030.5045.40Facebook AI研究实验室/
XVERSE-65B650.026.80/元象XVERSE/