DataLearner logoDataLearnerAI
AI Tech Blogs
Leaderboards
Benchmarks
Models
Resources
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
← Back to Main Leaderboard

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above
ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Phi 4 - 14B140.082.60/Microsoft Azure/
WizardCoder-Python-13B-V1.0130.06454.60WizardLM Team/
PanGu-Coder2150.061.64/华为

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

/
WizardCoder-15B-V1.0150.057.30/WizardLM Team/
Qwen2.5-14B140.056.7076.70阿里巴巴/
Moonlight-16B-A3B-Instruct160.048.1063.80Moonshot AI/
CodeLLaMA-Python-13B130.043.3049Facebook AI研究实验室/
CodeLLaMA-Instruct-13B130.042.7049.40Facebook AI研究实验室/
WizardLM-30B-V1300.037.80/WizardLM Team/
CodeLLaMA-13B130.03647Facebook AI研究实验室/
StarCoder155.033.6052.70BigCode/
Qwen-14B140.032.3040.80阿里巴巴/
StarCodeBase155.030.4049BigCode/
CodeGeeX130.022.90/智谱AI/
LLaMA2 13B130.020.1027.60Facebook AI研究实验室/
Baichuan2-13B-Base130.017.0730.20百川智能/
Baichuan 13B - Base130.011.5922.90百川智能/