Fisher Information guided Purification against Backdoor Attacks
Nazmul Karim, Abdullah Al Arafat, Adnan Siraj Rakin, Zhishan Guo and, Nazanin Rahnavard

TL;DR
This paper introduces Fisher Information guided Purification (FIP), a novel framework for effectively removing backdoors from deep neural networks by promoting smoother minima, with an efficient variant called Fast FIP that significantly reduces runtime.
Contribution
The paper proposes a new backdoor purification method based on Fisher Information, introducing regularizers that suppress backdoors while preserving clean data knowledge, and presents an efficient variant with substantial runtime improvements.
Findings
Achieves state-of-the-art results across diverse tasks and datasets.
Effectively removes backdoors with minimal impact on clean accuracy.
Fast FIP reduces runtime by approximately 5 times.
Abstract
Studies on backdoor attacks in recent years suggest that an adversary can compromise the integrity of a deep neural network (DNN) by manipulating a small set of training samples. Our analysis shows that such manipulation can make the backdoor model converge to a bad local minima, i.e., sharper minima as compared to a benign model. Intuitively, the backdoor can be purified by re-optimizing the model to smoother minima. However, a na\"ive adoption of any optimization targeting smoother minima can lead to sub-optimal purification techniques hampering the clean test accuracy. Hence, to effectively obtain such re-optimization, inspired by our novel perspective establishing the connection between backdoor removal and loss smoothness, we propose Fisher Information guided Purification (FIP), a novel backdoor purification framework. Proposed FIP consists of a couple of novel regularizers that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques
MethodsSparse Evolutionary Training
