TL;DR
This paper introduces Quatrain, a novel approach that uses natural language processing to assess patch correctness by evaluating the semantic correlation between bug reports and patch descriptions, achieving high accuracy on large datasets.
Contribution
The paper presents Quatrain, a new neural network-based method that correlates bug reports and patch descriptions to evaluate patch correctness, transforming the problem into a question-answering task.
Findings
Achieves an AUC of 0.886 in predicting patch correctness.
Recalls 93% of correct patches while filtering out 62% of incorrect ones.
Demonstrates the importance of input quality on prediction performance.
Abstract
In this work, we propose a novel perspective to the problem of patch correctness assessment: a correct patch implements changes that "answer" to a problem posed by buggy behaviour. Concretely, we turn the patch correctness assessment into a Question Answering problem. To tackle this problem, our intuition is that natural language processing can provide the necessary representations and models for assessing the semantic correlation between a bug (question) and a patch (answer). Specifically, we consider as inputs the bug reports as well as the natural language description of the generated patches. Our approach, Quatrain, first considers state of the art commit message generation models to produce the relevant inputs associated to each generated patch. Then we leverage a neural network architecture to learn the semantic correlation between bug reports and commit messages. Experiments on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
