LLM Coding Leaderboard
This page provides current LLM coding evaluation results, including HumanEval and MBPP Pass@1 scores.
Top Model
Qwen2.5-Omni-7B
Top Score
-
Model Count
30
Data version
-
Data source: 论文或GitHub评测结果
Ranking Table
| Model | Parameters | HumanEval Pass@1 | MBPP Pass@1 | Organization | License |
|---|---|---|---|---|---|
Qwen2.5-Omni-7B阿里巴巴 | 70 | 84.80 | 79.20 | 阿里巴巴 | — |
CodeQwen1.5-7B-Chat阿里巴巴 | 70 | 83.50 | 77.70 | 阿里巴巴 | — |
Llama3.1-8B-InstructFacebook AI研究实验室 | 80 | 72.60 | 72.80 | Facebook AI研究实验室 | — |
GLM-4-9B-Chat智谱AI | 90 | 71.80 | — | 智谱AI | — |
GLM-4-9B智谱AI | 90 | 70.10 | — | 智谱AI | — |
DeepSeek Coder-6.7B InstructDeepSeek-AI | 67 | 66.10 | 65.40 | DeepSeek-AI | — |
Llama3-8BFacebook AI研究实验室 | 80 | 62.20 | — | Facebook AI研究实验室 | — |
Llama3-8B-InstructFacebook AI研究实验室 | 80 | 62.20 | — | Facebook AI研究实验室 | — |
Phi-3-small 7BMicrosoft Azure | 70 | 59.10 | 71.40 | Microsoft Azure | — |
Qwen2.5-7B阿里巴巴 | 70 | 57.90 | 74.90 | 阿里巴巴 | — |
CodeGemma-7B-ITGoogle Research | 70 | 56.10 | 54.20 | Google Research | — |
CodeQwen1.5-7B阿里巴巴 | 70 | 51.80 | 72.20 | 阿里巴巴 | — |
Qwen2-7B阿里巴巴 | 70 | 51.20 | 65.90 | 阿里巴巴 | — |
CodeGemma-7BGoogle Research | 70 | 44.50 | 56.20 | Google Research | — |
Gemma 2 - 9BGoogle Research | 90 | 40.20 | 52.40 | Google Research | — |
CodeLLaMA-Python-7BFacebook AI研究实验室 | 70 | 38.40 | 47.60 | Facebook AI研究实验室 | — |
PaLM2-SGoogle Research | 0 | 37.60 | 50.00 | Google Research | — |
CodeGeeX2-6B智谱AI | 60 | 35.90 | — | 智谱AI | — |
CodeLLaMA-Instruct-7BFacebook AI研究实验室 | 70 | 34.80 | 44.40 | Facebook AI研究实验室 | — |
WizardCoder-3B-V1.0WizardLM Team | 30 | 34.80 | 37.40 | WizardLM Team | — |
CodeLLaMA-7BFacebook AI研究实验室 | 70 | 33.50 | 41.40 | Facebook AI研究实验室 | — |
Gemma 7BGoogle Research | 70 | 32.30 | 44.40 | Google Research | — |
Mistral 7BMistralAI | 73 | 30.50 | 47.50 | MistralAI | — |
Qwen-7B阿里巴巴 | 70 | 29.90 | 31.60 | 阿里巴巴 | — |
AquilaCode-7B-py北京智源人工智能研究院 | 70 | 28.80 | — | 北京智源人工智能研究院 | — |
WizardCoder-1B-V1.0WizardLM Team | 10 | 23.80 | 28.60 | WizardLM Team | — |
AquilaCode-7B-multi北京智源人工智能研究院 | 70 | 22.00 | — | 北京智源人工智能研究院 | — |
Baichuan2-7B-Base百川智能 | 70 | 18.29 | 24.20 | 百川智能 | — |
LLaMA2 7BFacebook AI研究实验室 | 70 | 12.20 | 20.80 | Facebook AI研究实验室 | — |
Baichuan 7B百川智能 | 70 | 9.20 | 6.60 | 百川智能 | — |
Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.









