Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering
Rishov Paul, Md. Mohib Hossain, Mohammed Latif Siddiq, Masum Hasan,, Anindya Iqbal, and Joanna C. S. Santos

TL;DR
This paper explores how fine-tuning large language models with code review data and prompt engineering can enhance automated program repair, but practical challenges remain in deploying these models effectively.
Contribution
It demonstrates that pre-trained models fine-tuned with review and code changes outperform previous models, and assesses prompt engineering with LLMs for program repair.
Findings
Fine-tuned models outperform previous approaches.
Prompt engineering with LLMs shows potential but practical use is challenging.
Manual analysis indicates current models are not yet ready for deployment.
Abstract
Sequence-to-sequence models have been used to transform erroneous programs into correct ones when trained with a large enough dataset. Some recent studies also demonstrated strong empirical evidence that code review could improve the program repair further. Large language models, trained with Natural Language (NL) and Programming Language (PL), can contain inherent knowledge of both. In this study, we investigate if this inherent knowledge of PL and NL can be utilized to improve automated program repair. We applied PLBART and CodeT5, two state-of-the-art language models that are pre-trained with both PL and NL, on two such natural language-based program repair datasets and found that the pre-trained language models fine-tuned with datasets containing both code review and subsequent code changes notably outperformed each of the previous models. With the advent of code generative models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Advanced Malware Detection Techniques · Software Engineering Research
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Repair · Adam · Cosine Annealing · Weight Decay · Linear Warmup With Cosine Annealing · 15 Ways to Contact How can i speak to someone at Delta Airlines · {Dispute@FaQ-s}How to file a dispute with Expedia?
