APPT: Boosting Automated Patch Correctness Prediction via Fine-tuning   Pre-trained Models

Quanjun Zhang; Chunrong Fang; Weisong Sun; Yan Liu; Tieke He; Xiaodong; Hao; Zhenyu Chen

arXiv:2301.12453·cs.SE·January 17, 2024·5 cites

APPT: Boosting Automated Patch Correctness Prediction via Fine-tuning Pre-trained Models

Quanjun Zhang, Chunrong Fang, Weisong Sun, Yan Liu, Tieke He, Xiaodong, Hao, Zhenyu Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces APPT, a novel approach that fine-tunes pre-trained models for automated patch correctness prediction, significantly improving accuracy over existing methods by fully leveraging pre-trained representations.

Contribution

The paper proposes a fine-tuning framework for pre-trained models in patch correctness prediction, enhancing representation quality and overall prediction performance.

Findings

01

APPT achieves 79.7% accuracy and 83.2% recall, outperforming previous methods.

02

Fine-tuning and LSTM components significantly improve prediction performance.

03

Advanced pre-trained models further enhance the generalizability and effectiveness of APPT.

Abstract

Automated program repair (APR) aims to fix software bugs automatically without human debugging efforts and plays a crucial role in software development and maintenance. Despite promising, APR is still challenged by a long-standing overfitting problem (i.e., the generated patch is plausible but overfitting). Various techniques have thus been proposed to address the overfitting problem. Recently, researchers have employed BERT to extract code features, which are then used to train a classifier for patch correctness prediction. However, BERT is restricted to feature extraction for classifier training without benefiting from the training process, potentially generating sub-optimal vector representations for patched code snippets. In this paper, we propose APPT, a pre-trained model-based automated patch correctness assessment technique by both pre-training and fine-tuning. APPT adopts a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

isenglab/appt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · AI in cancer detection · Machine Learning and Data Classification