SWE-bench Multilingual
Updated Apr 24, 2026·837 views
- Problem Count
- 300
- Institution
- —
- Category
- Coding and Software Engineering
- Metrics
- Accuracy
- Language
- Multilingual
- Difficulty
- Medium
Overview
SWE-bench Multilingual is an AI benchmark used to evaluate model capabilities. Review its overview, metrics, official resources, and model leaderboard results on DataLearnerAI.
Related resources
Latest SWE-bench Multilingual model rankings and full benchmark leaderboard
Browse the latest scores, model modes, release dates, and parameter sizes for SWE-bench Multilingual.
Source: DataLearnerAI
Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology
Model Mode Legend
License:
Origin:
Model release cutoff:
SWE-bench Multilingual Rank
| Rank | Model | License | |||
|---|---|---|---|---|---|
![]() Claude Mythos Preview Extended ThinkingTools | 87.30 | 2026-04-07 | Unknown | Closed | |
![]() Kimi K2.6 Thinking EnabledTools | 76.70 | 2026-04-20 | 1000B | Free Commercial | |
![]() DeepSeek-V4-Pro Thinking Level · Extra HighTools | 76.20 | 2026-04-24 | 1600B | Free Commercial | |
4 | ![]() DeepSeek-V4-Pro Thinking Level · HighTools | 74.10 | 2026-04-24 | 1600B | Free Commercial |
5 | ![]() Qwen 3.6 Plus Preview Thinking Enabled | 73.80 | 2026-03-31 | Unknown | Closed |
6 | ![]() Composer 2 Thinking Enabled | 73.70 | 2026-03-19 | Unknown | Closed |
7 | ![]() DeepSeek-V4-Flash Thinking Level · Extra HighTools | 73.30 | 2026-04-24 | 284B | Free Commercial |
8 | ![]() Kimi K2.5 Thinking Enabled | 73.00 | 2026-01-27 | 1000B | Free Commercial |
9 | ![]() Claude Opus 4.6 Extended ThinkingTools | 72.00 | 2026-02-05 | Unknown | Closed |
10 | ![]() Qwen3.6-27B Thinking EnabledTools | 71.30 | 2026-04-22 | 27B | Free Commercial |
11 | ![]() DeepSeek-V4-Flash Thinking Level · HighTools | 70.20 | 2026-04-24 | 284B | Free Commercial |
12 | ![]() DeepSeek-V4-Pro Standard ModeTools | 69.80 | 2026-04-24 | 1600B | Free Commercial |
13 | ![]() DeepSeek-V4-Flash Standard ModeTools | 69.70 | 2026-04-24 | 284B | Free Commercial |
14 | ![]() Qwen3.5-397B-A17B Thinking Enabled | 69.30 | 2026-02-16 | 39.7B | Free Commercial |
15 | ![]() Qwen3.6-35B-A3B Thinking Enabled | 67.20 | 2026-04-16 | 35B | Free Commercial |
16 | ![]() Composer 1.5 Thinking Enabled | 65.90 | 2026-02-09 | Unknown | Closed |
17 | ![]() Composer 1 Thinking Enabled | 56.90 | 2025-10-29 | Unknown | Closed |




