Using Developer Discussions to Guide Fixing Bugs in Software
Sheena Panthaplackel, Milos Gligoric, Junyi Jessy Li, Raymond J., Mooney

TL;DR
This paper explores leveraging bug report discussions as natural language context to improve automated bug fixing, demonstrating that discussions available before fixing can enhance model performance over traditional commit message-based approaches.
Contribution
It introduces a novel dataset augmentation method using bug report discussions and shows that this context improves bug-fixing models' effectiveness.
Findings
Discussions improve bug-fixing accuracy.
Natural language context from discussions outperforms commit messages.
Augmented datasets enhance model performance.
Abstract
Automatically fixing software bugs is a challenging task. While recent work showed that natural language context is useful in guiding bug-fixing models, the approach required prompting developers to provide this context, which was simulated through commit messages written after the bug-fixing code changes were made. We instead propose using bug report discussions, which are available before the task is performed and are also naturally occurring, avoiding the need for any additional information from developers. For this, we augment standard bug-fixing datasets with bug report discussions. Using these newly compiled datasets, we demonstrate that various forms of natural language context derived from such discussions can aid bug-fixing, even leading to improved performance over using commit messages corresponding to the oracle bug-fixing commits.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
