APPT: Boosting Automated Patch Correctness Prediction via Fine-tuning Pre-trained Models
Quanjun Zhang, Chunrong Fang, Weisong Sun, Yan Liu, Tieke He, Xiaodong, Hao, Zhenyu Chen

TL;DR
This paper introduces APPT, a novel approach that fine-tunes pre-trained models for automated patch correctness prediction, significantly improving accuracy over existing methods by fully leveraging pre-trained representations.
Contribution
The paper proposes a fine-tuning framework for pre-trained models in patch correctness prediction, enhancing representation quality and overall prediction performance.
Findings
APPT achieves 79.7% accuracy and 83.2% recall, outperforming previous methods.
Fine-tuning and LSTM components significantly improve prediction performance.
Advanced pre-trained models further enhance the generalizability and effectiveness of APPT.
Abstract
Automated program repair (APR) aims to fix software bugs automatically without human debugging efforts and plays a crucial role in software development and maintenance. Despite promising, APR is still challenged by a long-standing overfitting problem (i.e., the generated patch is plausible but overfitting). Various techniques have thus been proposed to address the overfitting problem. Recently, researchers have employed BERT to extract code features, which are then used to train a classifier for patch correctness prediction. However, BERT is restricted to feature extraction for classifier training without benefiting from the training process, potentially generating sub-optimal vector representations for patched code snippets. In this paper, we propose APPT, a pre-trained model-based automated patch correctness assessment technique by both pre-training and fine-tuning. APPT adopts a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · AI in cancer detection · Machine Learning and Data Classification
