Learning to Generate Corrective Patches using Neural Machine Translation

Hideaki Hata; Emad Shihab; Graham Neubig

arXiv:1812.07170·cs.SE·July 5, 2019·37 cites

Learning to Generate Corrective Patches using Neural Machine Translation

Hideaki Hata, Emad Shihab, Graham Neubig

PDF

Open Access 1 Repo

TL;DR

This paper introduces Ratchet, a neural machine translation system that learns from past bug fixes to generate corrective patches, demonstrating high syntactic validity and potential usefulness in automated bug repair.

Contribution

The paper presents Ratchet, a novel neural sequence-to-sequence model for automated bug fix generation based on learning from historical code corrections.

Findings

01

Ratchet generates syntactically valid code 98.7% of the time.

02

F1-measure ranges from 0.29 to 0.83 in fixing bugs.

03

Participants found Ratchet's suggestions helpful for bug fixing.

Abstract

Bug fixing is generally a manually-intensive task. However, recent work has proposed the idea of automated program repair, which aims to repair (at least a subset of) bugs in different ways such as code mutation, etc. Following in the same line of work as automated bug repair, in this paper we aim to leverage past fixes to propose fixes of current/future bugs. Specifically, we propose Ratchet, a corrective patch generation system using neural machine translation. By learning corresponding pre-correction and post-correction code in past fixes with a neural sequence-to-sequence model, Ratchet is able to generate a fix code for a given bug-prone code query. We perform an empirical study with five open source projects, namely Ambari, Camel, Hadoop, Jetty and Wicket, to evaluate the effectiveness of Ratchet. Our findings show that Ratchet can generate syntactically valid statements 98.7% of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hideakihata/nmtbasedcorrectivepatchgenerationdataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Advanced Malware Detection Techniques