Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair
Guochang Li, Chen Zhi, Jialiang Chen, Junxiao Han, Shuiguang Deng

TL;DR
This paper investigates parameter-efficient fine-tuning (PEFT) of large language models for automated program repair, demonstrating improved bug fixing performance and resource efficiency compared to full-model fine-tuning.
Contribution
It introduces an instruction dataset for APR, compares four PEFT methods, and analyzes hyperparameters, memory usage, and dataset size effects on performance.
Findings
PEFT improves bug fixing effectiveness over state-of-the-art methods.
$(IA)^3$ enhances LLM creativity and achieves top performance among PEFT methods.
Larger datasets and more parameters do not always lead to better PEFT results.
Abstract
Automated Program Repair (APR) aims to fix bugs by generating patches. And existing work has demonstrated that "pre-training and fine-tuning" paradigm enables Large Language Models (LLMs) improve fixing capabilities on APR. However, existing work mainly focuses on Full-Model Fine-Tuning (FMFT) for APR and limited research has been conducted on the execution-based evaluation of Parameter-Efficient Fine-Tuning (PEFT) for APR. Comparing to FMFT, PEFT can reduce computing resource consumption without compromising performance and has been widely adopted to other software engineering tasks. To fill this gap, we enhance the existing APR dataset by employing prompt engineering to create an instruction dataset, APR-INSTRUCTION, at first. Secondly, we fine-tune four pre-trained LLMs using four different PEFT methods with APR-INSTRUCTION. The best fine-tuned model fixes 58% more bugs than the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Radiation Effects in Electronics
