DataLearner logoDataLearnerAI
AI Tech Blogs
Leaderboards
Benchmarks
Models
Resources
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
← Back to Main Leaderboard

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above
ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Phi-3-mini 3.8B38.058.5070Microsoft Azure/
Phi-113.050.6055.50Microsoft Azure/
MiniCPM-2B-DPO24.05047.31面壁智能

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

/
Phi-227.048.3059.10Microsoft Azure/
Qwen2.5-3B30.042.1057.10阿里巴巴/
Qwen2.5-1.5B15.037.2060.20阿里巴巴/
Stable LM Zephyr 3B30.035.3731.85Stability AI/
Phi-1.513.034.1037.70Microsoft Azure/
Qwen2-1.5B15.031.1037.40阿里巴巴/
Qwen2.5-0.5B5.030.5039.30阿里巴巴/
Gemma 2B20.02229.20Google Research/
Gemma 2B - It20.02229.20Google Research/
CodeGemma-2B20.02229.20Google Research/
Qwen2-0.5B4.02222阿里巴巴/
RecurrentGemma-2B27.021.3028.80Google Research/
Qwen-1.8B18.015.20/阿里巴巴/
TinyLlama11.06.7119.91新加坡科技与设计大学/