DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeEvaluation Overview大模型编程能力评测排行榜

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Top Model

Phi 4 - 14B

Top Score

-

Model Count

17

Data version

-

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above

Ranking Table

ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Phi 4 - 14B140.082.60/Microsoft Azure/
WizardCoder-Python-13B-V1.0130.06454.60WizardLM Team/
PanGu-Coder2

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

150.0
61.64
/
华为
/
WizardCoder-15B-V1.0150.057.30/WizardLM Team/
Qwen2.5-14B140.056.7076.70阿里巴巴/
Moonlight-16B-A3B-Instruct160.048.1063.80Moonshot AI/
CodeLLaMA-Python-13B130.043.3049Facebook AI研究实验室/
CodeLLaMA-Instruct-13B130.042.7049.40Facebook AI研究实验室/
WizardLM-30B-V1300.037.80/WizardLM Team/
CodeLLaMA-13B130.03647Facebook AI研究实验室/
StarCoder155.033.6052.70BigCode/
Qwen-14B140.032.3040.80阿里巴巴/
StarCodeBase155.030.4049BigCode/
CodeGeeX130.022.90/智谱AI/
LLaMA2 13B130.020.1027.60Facebook AI研究实验室/
Baichuan2-13B-Base130.017.0730.20百川智能/
Baichuan 13B - Base130.011.5922.90百川智能/