Key strength: Includes a large and diverse set of 1,865 real-world software engineering tasks from 41 repositories.
Top feature: Extensive Task Set
Best for: AI Agent Evaluation
Pricing: open-source
Quick start: Install Docker
Quick reference
Key strength: Includes a large and diverse set of 1,865 real-world software engineering tasks from 41 repositories.
Top feature: Extensive Task Set
Best for: AI Agent Evaluation
Pricing: open-source
Quick start: Install Docker