Assessing the Latent Automated Program Repair Capabilities of Large Language Models using Round-Trip Translation

Fernando Vallecillos Ruiz; Anastasiia Grishina; Max Hort; Leon Moonen

arXiv:2401.07994·cs.SE·October 16, 2025·1 cites

Assessing the Latent Automated Program Repair Capabilities of Large Language Models using Round-Trip Translation

Fernando Vallecillos Ruiz, Anastasiia Grishina, Max Hort, Leon Moonen

PDF

Open Access 1 Repo

TL;DR

This paper investigates the use of round-trip translation with large language models to improve automated program repair, showing promising results but also highlighting limitations and potential as a complementary approach.

Contribution

It introduces the novel application of round-trip translation for automated program repair using LLMs, demonstrating its effectiveness and analyzing its limitations.

Findings

01

RTT generates plausible patches for 100/164 bugs with GPT-4.

02

97 patches were manually verified as correct.

03

RTT can find patches missed by fine-tuned LLMs.

Abstract

Research shows that errors in natural language can be corrected by translating texts to another language and back using language models. We explore to what extent this latent correction capability extends to Automated Program Repair (APR) by investigating Round-Trip Translation (RTT): translating code from one programming language into another programming or natural language and back, using Large Language Models (LLMs). We hypothesize that RTT restores patterns most commonly seen in the LLM's training corpora through regression toward the mean, replacing infrequent bugs with more frequent, natural, bug-free code. To test this hypothesis, we employ nine LLMs and four common APR benchmarks in Java, and perform a detailed quantitative and qualitative analysis of RTT-generated patches. We find that RTT through English generates plausible patches for 100 of 164 bugs with GPT-4 on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://zenodo.org/record/10500594
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software System Performance and Reliability

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Linear Layer · Dropout · Adam · Cosine Annealing · Dense Connections