DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeEvaluation Overview大模型编程能力评测排行榜

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Top Model

Qwen2.5-Coder-32B-Instruct

Top Score

-

Model Count

19

Data version

-

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above

Ranking Table

ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Qwen2.5-Coder-32B-Instruct320.092.7090.20阿里巴巴/
Mistral Small 24B Instruct 2501240.084.80/MistralAI/
DeepSeek Coder-33B Instruct330.079.3070DeepSeek-AI/
WizardCoder-Python-34B340.073.20/WizardLM Team/
Phind-CodeLlama-34B-Python-v1340.069.50/Phind/
Phind-CodeLlama-34B-v1340.067.60/Phind/
Codestral220.061.5078.20MistralAI/
Qwen2.5-32B320.058.5084.50阿里巴巴/
CodeLLaMA-Python-34B340.053.7056.20Facebook AI研究实验室/
YAYI2-30B300.053.1045.80中科闻歌/
CodeLLaMA-34B340.048.8055Facebook AI研究实验室/
Yi-1.5-34B340.046.3065.50零一万物/
CodeLLaMA-Instruct-34B340.041.5057Facebook AI研究实验室/
Grok-0330.039.70/xAI/
Qwen1.5-32B320.037.2049.40阿里巴巴/
Aquila2-34B340.035.40/北京智源人工智能研究院/
XVERSE-MoE-A4.2B258.029.90/元象XVERSE/
LLaMA2 34B340.022.6033.80Facebook AI研究实验室/
Mistral Small 24B Base2501240.0/69.64MistralAI/

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.