DataLearner logoDataLearnerAI
Latest AI Insights
Model Evaluations
Model Directory
Model Comparison
Resource Center
Tools
LanguageEnglish

加载中...

DataLearner logoDataLearner AI

A knowledge platform focused on LLM benchmarking, datasets, and practical instruction with continuously updated capability maps.

Products

  • Leaderboards
  • Model comparison
  • Datasets

Resources

  • Tutorials
  • Editorial
  • Tool directory

Company

  • About
  • Privacy policy
  • Data methodology
  • Contact

© 2026 DataLearner AI. DataLearner curates industry data and case studies so researchers, enterprises, and developers can rely on trustworthy intelligence.

Privacy policyTerms of service
Back to Main Leaderboard

大模型代码编程能力评测排行榜

本页面提供大模型代码编程能力评测排行榜,涵盖 SWE-Bench、LiveCodeBench、HumanEval 等数据集,对 GPT、Claude、Qwen、DeepSeek 等模型进行对比。

Updated on: 2025/10/12 20:54:51
SWE-bench VerifiedLiveCodeBenchHumanEval
More Benchmarks
Model Size:All3B and below7B13B34B65B100B and above
Model Type:AllReasoning ModelsFoundation ModelsInstruction/Chat ModelsCoding Models

LLM Performance Results

Data source: DataLearnerAI
RankModelSWE-bench VerifiedLiveCodeBenchHumanEvalParams (B)License
1Qwen3.5-397B-A17B76.4083.600.00397BFree commercial
2Qwen3.5-27B72.4080.700.00270BFree commercial
3GLM-4.7-Flash59.200.000.00310BFree commercial
4Devstral Small 1.153.600.000.00240BFree commercial
5Qwen3-Coder-Flash51.600.000.00305BFree commercial
6Devstral Small 1.046.800.000.00240BFree commercial
7GPT OSS 20B34.000.000.00210BFree commercial
8Qwen3-30B-A3B-250722.0043.200.00305BFree commercial
9Qwen3-30B-A3B0.0029.000.00305BFree commercial
10Mistral-Small-3.1-24B-Instruct-25030.000.0088.41240BFree commercial
11Magistral-Small-25060.0055.840.00240BFree commercial
12Qwen3-32B0.0065.700.00320BFree commercial
13Qwen3-235B-A22B-Thinking0.0074.100.00305BFree commercial
14Gemma 4 31B0.0080.000.00310BFree commercial
15QwQ-32B0.000.0019.00325BFree commercial
16Gemma2-27B0.000.0051.80270BFree commercial
17C4AI Aya Vision 32B0.000.0062.20320BNon-commercial
18Codestral0.0031.5081.10220BNon-commercial
19Gemma 3 - 27B (IT)0.0029.7087.80270BFree commercial
20Qwen2.5-32B0.0051.2088.40320BFree commercial
1
Qwen3.5-397B-A17B
397B
SWE-bench Verified76.40
LiveCodeBench83.60
HumanEval0.00
Free commercial
2
Qwen3.5-27B
270B
SWE-bench Verified72.40
LiveCodeBench80.70
HumanEval0.00
Free commercial
3
GLM-4.7-Flash
310B
SWE-bench Verified59.20
LiveCodeBench0.00
HumanEval0.00
Free commercial
4
Devstral Small 1.1
240B
SWE-bench Verified53.60
LiveCodeBench0.00
HumanEval0.00
Free commercial
5
Qwen3-Coder-Flash
305B
SWE-bench Verified51.60
LiveCodeBench0.00
HumanEval0.00
Free commercial
6
Devstral Small 1.0
240B
SWE-bench Verified46.80
LiveCodeBench0.00
HumanEval0.00
Free commercial
7
GPT OSS 20B
210B
SWE-bench Verified34.00
LiveCodeBench0.00
HumanEval0.00
Free commercial
8
Qwen3-30B-A3B-2507
305B
SWE-bench Verified22.00
LiveCodeBench43.20
HumanEval0.00
Free commercial
9
Qwen3-30B-A3B
305B
SWE-bench Verified0.00
LiveCodeBench29.00
HumanEval0.00
Free commercial
10
Mistral-Small-3.1-24B-Instruct-2503
240B
SWE-bench Verified0.00
LiveCodeBench0.00
HumanEval88.41
Free commercial
11
Magistral-Small-2506
240B
SWE-bench Verified0.00
LiveCodeBench55.84
HumanEval0.00
Free commercial
12
Qwen3-32B
320B
SWE-bench Verified0.00
LiveCodeBench65.70
HumanEval0.00
Free commercial
13
Qwen3-235B-A22B-Thinking
305B
SWE-bench Verified0.00
LiveCodeBench74.10
HumanEval0.00
Free commercial
14
Gemma 4 31B
310B
SWE-bench Verified0.00
LiveCodeBench80.00
HumanEval0.00
Free commercial
15
QwQ-32B
325B
SWE-bench Verified0.00
LiveCodeBench0.00
HumanEval19.00
Free commercial
16
Gemma2-27B
270B
SWE-bench Verified0.00
LiveCodeBench0.00
HumanEval51.80
Free commercial
17
C4AI Aya Vision 32B
320B
SWE-bench Verified0.00
LiveCodeBench0.00
HumanEval62.20
Non-commercial
18
Codestral
220B
SWE-bench Verified0.00
LiveCodeBench31.50
HumanEval81.10
Non-commercial
19
Gemma 3 - 27B (IT)
270B
SWE-bench Verified0.00
LiveCodeBench29.70
HumanEval87.80
Free commercial
20
Qwen2.5-32B
320B
SWE-bench Verified0.00
LiveCodeBench51.20
HumanEval88.40
Free commercial