PAD-FT: A Lightweight Defense for Backdoor Attacks via Data Purification and Fine-Tuning
Yukai Xu, Yujie Gu, Kouichi Sakurai

TL;DR
This paper introduces PAD-FT, a lightweight, efficient defense against backdoor attacks on neural networks that does not require extra clean data and only fine-tunes a small part of the model.
Contribution
It proposes a novel data purification and fine-tuning method that is computationally efficient and effective against various backdoor attack methods without needing additional datasets.
Findings
Effective against multiple backdoor attack methods
Does not require an additional clean dataset
Fine-tunes only the last classification layer
Abstract
Backdoor attacks pose a significant threat to deep neural networks, particularly as recent advancements have led to increasingly subtle implantation, making the defense more challenging. Existing defense mechanisms typically rely on an additional clean dataset as a standard reference and involve retraining an auxiliary model or fine-tuning the entire victim model. However, these approaches are often computationally expensive and not always feasible in practical applications. In this paper, we propose a novel and lightweight defense mechanism, termed PAD-FT, that does not require an additional clean dataset and fine-tunes only a very small part of the model to disinfect the victim model. To achieve this, our approach first introduces a simple data purification process to identify and select the most-likely clean data from the poisoned training dataset. The self-purified clean dataset is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Cryptographic Implementations and Security
