BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills
Atharv Sonwane, Isadora White, Hyunji Lee, Matheus Pereira, Lucas Caccia, Minseon Kim, Zhengyan Shi, Chinmay Singh, Alessandro Sordoni, Marc-Alexandre C\^ot\'e, Xingdi Yuan

TL;DR
This paper presents BugPilot, a novel synthetic bug generation method that creates realistic, diverse bugs to improve training of SWE agents, leading to better performance with less data.
Contribution
Introducing BugPilot, a new bug generation approach that produces more realistic bugs, enhancing supervised learning efficiency for SWE models.
Findings
Generated bugs outperform existing datasets in training efficiency.
Models trained on BugPilot bugs achieve state-of-the-art results.
Synthetic bugs closely mimic human-authored code edits.
Abstract
High quality bugs are key to training the next generation of language model based software engineering (SWE) agents. We introduce a novel method for synthetic generation of difficult and diverse bugs. Our method instructs SWE Agents to introduce a feature into the codebase whereby they may unintentionally break tests, resulting in bugs. Prior approaches often induce an out-of-distribution effect by generating bugs intentionally (e.g. by introducing local perturbation to existing code), which does not reflect realistic development processes. We perform qualitative analysis to demonstrate that our approach for generating bugs more closely reflects the patterns found in human-authored edits. Through extensive experiments, we demonstrate that our bugs provide more efficient training data for supervised fine-tuning, outperforming other bug datasets by 2% with half the training data (1.2k vs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗microsoft/FrogBoss-32B-2510model· 6.3k dl· ♡ 296.3k dl♡ 29
- 🤗microsoft/FrogMini-14B-2510model· 142 dl· ♡ 61142 dl♡ 61
- 🤗cyankiwi/FrogMini-14B-2510-AWQ-4bitmodel· 7 dl7 dl
- 🤗cyankiwi/FrogMini-14B-2510-AWQ-8bitmodel· 1 dl1 dl
- 🤗cyankiwi/FrogBoss-32B-2510-AWQ-4bitmodel· 5 dl5 dl
- 🤗cyankiwi/FrogBoss-32B-2510-AWQ-8bitmodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Topic Modeling
