Fisher Information guided Purification against Backdoor Attacks

Nazmul Karim; Abdullah Al Arafat; Adnan Siraj Rakin; Zhishan Guo and; Nazanin Rahnavard

arXiv:2409.00863·cs.CV·September 4, 2024

Fisher Information guided Purification against Backdoor Attacks

Nazmul Karim, Abdullah Al Arafat, Adnan Siraj Rakin, Zhishan Guo and, Nazanin Rahnavard

PDF

Open Access 1 Repo

TL;DR

This paper introduces Fisher Information guided Purification (FIP), a novel framework for effectively removing backdoors from deep neural networks by promoting smoother minima, with an efficient variant called Fast FIP that significantly reduces runtime.

Contribution

The paper proposes a new backdoor purification method based on Fisher Information, introducing regularizers that suppress backdoors while preserving clean data knowledge, and presents an efficient variant with substantial runtime improvements.

Findings

01

Achieves state-of-the-art results across diverse tasks and datasets.

02

Effectively removes backdoors with minimal impact on clean accuracy.

03

Fast FIP reduces runtime by approximately 5 times.

Abstract

Studies on backdoor attacks in recent years suggest that an adversary can compromise the integrity of a deep neural network (DNN) by manipulating a small set of training samples. Our analysis shows that such manipulation can make the backdoor model converge to a bad local minima, i.e., sharper minima as compared to a benign model. Intuitively, the backdoor can be purified by re-optimizing the model to smoother minima. However, a na\"ive adoption of any optimization targeting smoother minima can lead to sub-optimal purification techniques hampering the clean test accuracy. Hence, to effectively obtain such re-optimization, inspired by our novel perspective establishing the connection between backdoor removal and loss smoothness, we propose Fisher Information guided Purification (FIP), a novel backdoor purification framework. Proposed FIP consists of a couple of novel regularizers that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nazmul-karim170/FIP-Fisher-Backdoor-Removal
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques

MethodsSparse Evolutionary Training