DataLearner logoDataLearnerAI
AI Tech Blogs
Leaderboards
Benchmarks
Models
Resources
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
← Back to Main Leaderboard

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above
ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Qwen2.5-Coder-32B-Instruct320.092.7090.20阿里巴巴/
Mistral Small 24B Instruct 2501240.084.80/MistralAI/
DeepSeek Coder-33B Instruct330.079.3070DeepSeek-AI

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.

/
WizardCoder-Python-34B340.073.20/WizardLM Team/
Phind-CodeLlama-34B-Python-v1340.069.50/Phind/
Phind-CodeLlama-34B-v1340.067.60/Phind/
Codestral220.061.5078.20MistralAI/
Qwen2.5-32B320.058.5084.50阿里巴巴/
CodeLLaMA-Python-34B340.053.7056.20Facebook AI研究实验室/
YAYI2-30B300.053.1045.80中科闻歌/
CodeLLaMA-34B340.048.8055Facebook AI研究实验室/
Yi-1.5-34B340.046.3065.50零一万物/
CodeLLaMA-Instruct-34B340.041.5057Facebook AI研究实验室/
Grok-0330.039.70/xAI/
Qwen1.5-32B320.037.2049.40阿里巴巴/
Aquila2-34B340.035.40/北京智源人工智能研究院/
XVERSE-MoE-A4.2B258.029.90/元象XVERSE/
LLaMA2 34B340.022.6033.80Facebook AI研究实验室/
Mistral Small 24B Base2501240.0/69.64MistralAI/