DataLearner logoDataLearnerAI
AI Tech Blogs
Leaderboards
Benchmarks
Models
Resources
Tool Directory

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

产品

  • Leaderboards
  • 模型对比
  • Datasets

资源

  • Tutorials
  • Editorial
  • Tool directory

关于

  • 关于我们
  • 隐私政策
  • 数据收集方法
  • 联系我们

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

隐私政策服务条款
← Back to Main Leaderboard

大模型编程能力评测排行榜

本页面提供当前主流大模型在代码能力上的评测结果,包括HumanEval和MBPP等基准数据集。

Data source: 论文或GitHub评测结果

Filters

Filter by size:All3B and below7B13B34B65B100B and above
ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Claude 3.5 Sonnet New0.093.70/Anthropic/
Qwen2.5-Coder-32B-Instruct320.092.7090.20阿里巴巴/
OpenAI o1-mini/92.40/OpenAI/
Claude 3.5 Sonnet/92/Anthropic/
GPT-4o0.090.20/OpenAI/
Llama3.1-405B Instruct4050.08988.60Facebook AI研究实验室/
DeepSeek V2.52360.089/DeepSeek-AI/
Amazon Nova Pro/89/亚马逊/
Llama3.3-70B-Instruct700.088.4087.60Facebook AI研究实验室/
Grok 22690.088.40/xAI/
Claude 3.5 Haiku0.088.10/Anthropic/
GPT-4o mini0.087.20/OpenAI/
Codestral 25.01/86.6080.20MistralAI/
Qwen2-72B-Instruct720.08680.20阿里巴巴/
GPT-41750.085.4083.50OpenAI/
Amazon Nova Lite/85.40/亚马逊/
Claude3-Opus0.084.90/Anthropic/
Mistral Small 24B Instruct 2501240.084.80/MistralAI/
Qwen2.5-Omni-7B70.084.8079.20阿里巴巴/
Llama3-400B-Instruct-InTraining4000.084.10/Facebook AI研究实验室/
CodeQwen1.5-7B-Chat70.083.5077.70阿里巴巴/
Phi 4 - 14B140.082.60/Microsoft Azure/
DeepSeek-V36810.082.60/DeepSeek-AI/
Llama3-70B700.081.70/Facebook AI研究实验室/
Llama3-70B-Instruct700.081.70/Facebook AI研究实验室/
Amazon Nova Micro/81.10/亚马逊/
Llama3.1-70B-Instruct700.080.5086Facebook AI研究实验室/
C4AI Command A (202503)1110.080/CohereAI/
DeepSeek Coder-33B Instruct330.079.3070DeepSeek-AI/
Claude3-Haiku0.075.90/Anthropic/
Gemini-ultra0.074.40/DeepMind/
Grok-1.5/74.10/xAI/
DeepSeek-V2-236B-Chat2360.073.8061.40DeepSeek-AI/
WizardCoder-Python-34B340.073.20/WizardLM Team/
Qwen2.5-Max/73.2080.60阿里巴巴/
Claude3-Sonnet0.073/Anthropic/
Llama3.1-8B-Instruct80.072.6072.80Facebook AI研究实验室/
GLM40.072/智谱AI/
Gemini 1.5 Pro0.071.90/Google Deep Mind/
GLM-4-9B-Chat90.071.80/智谱AI/
DBRX Instruct1320.070.10/databricks/
GLM-4-9B90.070.10/智谱AI/
Phind-CodeLlama-34B-Python-v1340.069.50/Phind/
Gemini-pro1000.067.70/DeepMind/
Phind-CodeLlama-34B-v1340.067.60/Phind/
DeepSeek Coder-6.7B Instruct67.066.1065.40DeepSeek-AI/
DeepSeek-V3-Base6810.065.2075.40DeepSeek-AI/
Qwen2-72B727.064.6076.90阿里巴巴/
WizardCoder-Python-13B-V1.0130.06454.60WizardLM Team/
Grok-13140.063.20/xAI/
Llama3-8B80.062.20/Facebook AI研究实验室/
Llama3-8B-Instruct80.062.20/Facebook AI研究实验室/
PanGu-Coder2150.061.64/华为/
Codestral220.061.5078.20MistralAI/
Phi-3-small 7B70.059.1071.40Microsoft Azure/
Qwen2.5-72B727.059.1084.70阿里巴巴/
Phi-3-mini 3.8B38.058.5070Microsoft Azure/
Qwen2.5-32B320.058.5084.50阿里巴巴/
Qwen2.5-7B70.057.9074.90阿里巴巴/
WizardCoder-15B-V1.0150.057.30/WizardLM Team/
Qwen2.5-14B140.056.7076.70阿里巴巴/
CodeGemma-7B-IT70.056.1054.20Google Research/
Phi-3-medium 14B-preview140.055.5074.40Microsoft Azure/
MiniCPM-MoE-8x2B136.055.4941.68OpenBMB/
CodeLLaMA-Python-34B340.053.7056.20Facebook AI研究实验室/
YAYI2-30B300.053.1045.80中科闻歌/
Qwen2-57B-A14B570.05371.90阿里巴巴/
Qwen1.5-110B1100.052.4058.10阿里巴巴/
CodeQwen1.5-7B70.051.8072.20阿里巴巴/
Qwen2-7B70.051.2065.90阿里巴巴/
Phi-113.050.6055.50Microsoft Azure/
MiniCPM-2B-DPO24.05047.31面壁智能/
CodeLLaMA-34B340.048.8055Facebook AI研究实验室/
Phi-227.048.3059.10Microsoft Azure/
GPT-3.51750.048.1052.20OpenAI/
Moonlight-16B-A3B-Instruct160.048.1063.80Moonshot AI/
Yi-1.5-34B340.046.3065.50零一万物/
Mixtral-8×22B-MoE1410.045.1071.20MistralAI/
CodeGemma-7B70.044.5056.20Google Research/
CodeLLaMA-Python-13B130.043.3049Facebook AI研究实验室/
CodeLLaMA-Instruct-13B130.042.7049.40Facebook AI研究实验室/
Qwen2.5-3B30.042.1057.10阿里巴巴/
CodeLLaMA-Instruct-34B340.041.5057Facebook AI研究实验室/
Qwen1.5-72B-Chat720.041.5053.40阿里巴巴/
Yi-1.5-9B90.041.4061.10零一万物/
DeepSeek-V2-236B2360.040.9066.60DeepSeek-AI/
Mixtral-8×7B-MoE450.040.2060.70MistralAI/
Gemma 2 - 9B90.040.2052.40Google Research/
Grok-0330.039.70/xAI/
Yi-9B90.03954.40零一万物/
CodeLLaMA-Python-7B70.038.4047.60Facebook AI研究实验室/
WizardLM-30B-V1300.037.80/WizardLM Team/
PaLM2-S0.037.6050Google Research/
Qwen1.5-32B320.037.2049.40阿里巴巴/
Qwen2.5-1.5B15.037.2060.20阿里巴巴/
CodeLLaMA-13B130.03647Facebook AI研究实验室/
CodeGeeX2-6B60.035.90/智谱AI/
PaLM-Coder5400.035.9047Google Research/
Aquila2-34B340.035.40/北京智源人工智能研究院/
Qwen-72B720.035.4052.20阿里巴巴/
Stable LM Zephyr 3B30.035.3731.85Stability AI/
CodeLLaMA-Instruct-7B70.034.8044.40Facebook AI研究实验室/
WizardCoder-3B-V1.030.034.8037.40WizardLM Team/
Qwen1.5-MoE-A2.7B143.034.20/阿里巴巴/
Phi-1.513.034.1037.70Microsoft Azure/
StarCoder155.033.6052.70BigCode/
CodeLLaMA-7B70.033.5041.40Facebook AI研究实验室/
Qwen-14B140.032.3040.80阿里巴巴/
Gemma 7B70.032.3044.40Google Research/
Qwen2-1.5B15.031.1037.40阿里巴巴/
LLaMA2 70B700.030.5045.40Facebook AI研究实验室/
Mistral 7B73.030.5047.50MistralAI/
Qwen2.5-0.5B5.030.5039.30阿里巴巴/
StarCodeBase155.030.4049BigCode/
Qwen-7B70.029.9031.60阿里巴巴/
XVERSE-MoE-A4.2B258.029.90/元象XVERSE/
Codex1750.028.81/OpenAI/
AquilaCode-7B-py70.028.80/北京智源人工智能研究院/
XVERSE-65B650.026.80/元象XVERSE/
PaLM5400.026.2047Google Research/
WizardCoder-1B-V1.010.023.8028.60WizardLM Team/
CodeGeeX130.022.90/智谱AI/
LLaMA2 34B340.022.6033.80Facebook AI研究实验室/
AquilaCode-7B-multi70.022/北京智源人工智能研究院/
Gemma 2B20.02229.20Google Research/
Gemma 2B - It20.02229.20Google Research/
CodeGemma-2B20.02229.20Google Research/
Qwen2-0.5B4.02222阿里巴巴/
RecurrentGemma-2B27.021.3028.80Google Research/
LLaMA2 13B130.020.1027.60Facebook AI研究实验室/
Baichuan2-7B-Base70.018.2924.20百川智能/
Baichuan2-13B-Base130.017.0730.20百川智能/
Qwen-1.8B18.015.20/阿里巴巴/
LLaMA2 7B70.012.2020.80Facebook AI研究实验室/
Baichuan 13B - Base130.011.5922.90百川智能/
Baichuan 7B70.09.206.60百川智能/
TinyLlama11.06.7119.91新加坡科技与设计大学/
Mistral Large0.04.107.10MistralAI/
Mistral Small 24B Base2501240.0/69.64MistralAI/

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.