SWE-Bench Pro - Commercial

1,708 views

Problem Count: 858
Institution: Scale AI
Category: Coding and Software Engineering
Metrics: Accuracy
Language: English
Difficulty: Mixed

Overview

A commercial benchmark dataset for evaluating whether models can solve realistic, complex software-engineering tasks.

Related resources

View Paper
Get Dataset
Official Website
DataLearner Blog

Latest SWE-Bench Pro - Commercial model rankings and full benchmark leaderboard

Browse the latest scores, model modes, release dates, and parameter sizes for SWE-Bench Pro - Commercial.

Source: DataLearnerAI

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Model Mode Legend

License:

Origin:

Model release cutoff:

No benchmark data available yet