Creating training data for software engineering agents is difficult. Until now.
Introducing SWE-smith: Generate 100s to 1000s of task instances for any GitHub repository.
We’ve generated 50k+ task instances for 128 popular GitHub repositories, then trained our own LM for SWE-agent.