Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor
Shaokui Wei, Hongyuan Zha, Baoyuan Wu

TL;DR
This paper introduces PDB, a proactive defense method that injects a secret defensive backdoor during training to neutralize malicious backdoors, achieving state-of-the-art security without compromising model utility.
Contribution
The paper proposes a novel proactive defense approach called PDB that injects a secret defensive backdoor during training to mitigate malicious backdoor attacks.
Findings
PDB effectively suppresses malicious backdoors across various datasets.
PDB maintains high model utility while defending against backdoor attacks.
Experimental results outperform existing detection and removal methods.
Abstract
Data-poisoning backdoor attacks are serious security threats to machine learning models, where an adversary can manipulate the training dataset to inject backdoors into models. In this paper, we focus on in-training backdoor defense, aiming to train a clean model even when the dataset may be potentially poisoned. Unlike most existing methods that primarily detect and remove/unlearn suspicious samples to mitigate malicious backdoor attacks, we propose a novel defense approach called PDB (Proactive Defensive Backdoor). Specifically, PDB leverages the home-field advantage of defenders by proactively injecting a defensive backdoor into the model during training. Taking advantage of controlling the training process, the defensive backdoor is designed to suppress the malicious backdoor effectively while remaining secret to attackers. In addition, we introduce a reversible mapping to determine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Cryptographic Implementations and Security · Security and Verification in Computing
MethodsFocus
