Use Cases - swebench

Researchers can use SWE-bench to test how well their large language models generate patches that fix real GitHub issues.

Developers can leverage the SWE-smith dataset and SWE-bench subsets to train models on realistic issue resolution scenarios.