DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeEvaluation Overview大模型编程能力评测排行榜

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Top Model

Phi-3-mini 3.8B

Top Score

-

Model Count

17

Data version

-

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above

Ranking Table

ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Phi-3-mini 3.8B38.058.5070Microsoft Azure/
Phi-113.050.6055.50Microsoft Azure/
MiniCPM-2B-DPO

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

24.0
50
47.31
面壁智能
/
Phi-227.048.3059.10Microsoft Azure/
Qwen2.5-3B30.042.1057.10阿里巴巴/
Qwen2.5-1.5B15.037.2060.20阿里巴巴/
Stable LM Zephyr 3B30.035.3731.85Stability AI/
Phi-1.513.034.1037.70Microsoft Azure/
Qwen2-1.5B15.031.1037.40阿里巴巴/
Qwen2.5-0.5B5.030.5039.30阿里巴巴/
Gemma 2B20.02229.20Google Research/
Gemma 2B - It20.02229.20Google Research/
CodeGemma-2B20.02229.20Google Research/
Qwen2-0.5B4.02222阿里巴巴/
RecurrentGemma-2B27.021.3028.80Google Research/
Qwen-1.8B18.015.20/阿里巴巴/
TinyLlama11.06.7119.91新加坡科技与设计大学/