DataLearner logoDataLearnerAI
Latest AI Insights
Model Leaderboards
Benchmarks
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish
DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
HomeOverall LeaderboardLLM Coding Leaderboard

LLM Coding Leaderboard

This page provides current LLM coding evaluation results, including HumanEval and MBPP Pass@1 scores.

Top Model

Llama3.3-70B-Instruct

Top Score

-

Model Count

14

Data version

-

Data source: 论文或GitHub评测结果

Filter by size:All3B and below7B13B34B65B100B and above
Origin:AllChina
Leaderboard snapshot month:

Ranking Table

ModelParametersHumanEval Pass@1MBPP Pass@1OrganizationLicense
Facebook AI研究实验室Llama3.3-70B-InstructFacebook AI研究实验室70088.4087.60Facebook AI研究实验室—
阿里巴巴Qwen2-72B-Instruct阿里巴巴72086.0080.20阿里巴巴—
Facebook AI研究实验室Llama3-70BFacebook AI研究实验室70081.70—Facebook AI研究实验室—
Facebook AI研究实验室Llama3-70B-InstructFacebook AI研究实验室70081.70—Facebook AI研究实验室—
Facebook AI研究实验室Llama3.1-70B-InstructFacebook AI研究实验室70080.5086.00Facebook AI研究实验室—
DeepMindGemini-proDeepMind1,00067.70—DeepMind—
阿里巴巴Qwen2-72B阿里巴巴72764.6076.90阿里巴巴—
阿里巴巴Qwen2.5-72B阿里巴巴72759.1084.70阿里巴巴—
阿里巴巴Qwen2-57B-A14B阿里巴巴57053.0071.90阿里巴巴—
阿里巴巴Qwen1.5-72B-Chat阿里巴巴72041.5053.40阿里巴巴—
MistralAIMixtral-8×7B-MoEMistralAI45040.2060.70MistralAI—
阿里巴巴Qwen-72B阿里巴巴72035.4052.20阿里巴巴—
Facebook AI研究实验室LLaMA2 70BFacebook AI研究实验室70030.5045.40Facebook AI研究实验室—
元象XVERSEXVERSE-65B元象XVERSE65026.80—元象XVERSE—

Data is for reference only. Official sources are authoritative. Click model names to view DataLearner model profiles.