SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling
Haoran Wang, Zhenyu Hou, Yao Wei, Jie Tang, Yuxiao Dong

TL;DR
SWE-Dev introduces a scalable approach to building software engineering agents using open-source LLMs, with a new data synthesis pipeline and benchmark results showing top performance among open models.
Contribution
The paper presents SWE-Dev, a novel framework that scales training data for SWE agents and demonstrates superior open-source model performance.
Findings
SWE-Dev 7B achieves 23.4% success rate.
SWE-Dev 32B achieves 36.6% success rate.
Outperforms existing open-source SWE agents.
Abstract
Large language models (LLMs) have advanced rapidly from conversational problem solving to addressing real-world tasks involving tool use, such as software engineering (SWE). Recent LLM-powered toolkits, such as OpenAI Codex and Cursor, have offered end-to-end automation of the software development process. However, building effective SWE agents remains challenging due to the lack of high-quality training data and effective test cases. To address this issue, we present SWE-Dev, an SWE agent built upon open-source LLMs. First, we develop a robust pipeline to synthesize test cases for patch evaluation. Second, we scale up agent trajectories to construct the training data for building SWE-Dev. Experiments on the SWE-bench-Verified benchmark show that the SWE-Dev models can achieve top performance among all open SWE agents. Specifically, the success rates of the SWE-Dev 7B and 32B parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗zai-org/SWE-Dev-7Bmodel· 32 dl· ♡ 632 dl♡ 6
- 🤗zai-org/SWE-Dev-32Bmodel· 18 dl· ♡ 3018 dl♡ 30
- 🤗zai-org/SWE-Dev-9Bmodel· 9 dl· ♡ 129 dl♡ 12
- 🤗Mungert/SWE-Dev-32B-GGUFmodel· 250 dl· ♡ 5250 dl♡ 5
- 🤗Mungert/SWE-Dev-7B-GGUFmodel· 94 dl· ♡ 594 dl♡ 5
- 🤗target919/affine-sw-test-1-5DV5SWR7BXRfQTRRTGsBhEu7aJVXKb1TF7kYfG9o1L3jNi9imodel· 4 dl4 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Software Engineering Research · Topic Modeling
