Assessing the Latent Automated Program Repair Capabilities of Large Language Models using Round-Trip Translation
Fernando Vallecillos Ruiz, Anastasiia Grishina, Max Hort, Leon Moonen

TL;DR
This paper investigates the use of round-trip translation with large language models to improve automated program repair, showing promising results but also highlighting limitations and potential as a complementary approach.
Contribution
It introduces the novel application of round-trip translation for automated program repair using LLMs, demonstrating its effectiveness and analyzing its limitations.
Findings
RTT generates plausible patches for 100/164 bugs with GPT-4.
97 patches were manually verified as correct.
RTT can find patches missed by fine-tuned LLMs.
Abstract
Research shows that errors in natural language can be corrected by translating texts to another language and back using language models. We explore to what extent this latent correction capability extends to Automated Program Repair (APR) by investigating Round-Trip Translation (RTT): translating code from one programming language into another programming or natural language and back, using Large Language Models (LLMs). We hypothesize that RTT restores patterns most commonly seen in the LLM's training corpora through regression toward the mean, replacing infrequent bugs with more frequent, natural, bug-free code. To test this hypothesis, we employ nine LLMs and four common APR benchmarks in Java, and perform a detailed quantitative and qualitative analysis of RTT-generated patches. We find that RTT through English generates plausible patches for 100 of 164 bugs with GPT-4 on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software System Performance and Reliability
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Linear Layer · Dropout · Adam · Cosine Annealing · Dense Connections
