Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

Shaokui Wei; Hongyuan Zha; Baoyuan Wu

arXiv:2405.16112·cs.CR·October 16, 2024·2 cites

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

Shaokui Wei, Hongyuan Zha, Baoyuan Wu

PDF

Open Access 2 Repos

TL;DR

This paper introduces PDB, a proactive defense method that injects a secret defensive backdoor during training to neutralize malicious backdoors, achieving state-of-the-art security without compromising model utility.

Contribution

The paper proposes a novel proactive defense approach called PDB that injects a secret defensive backdoor during training to mitigate malicious backdoor attacks.

Findings

01

PDB effectively suppresses malicious backdoors across various datasets.

02

PDB maintains high model utility while defending against backdoor attacks.

03

Experimental results outperform existing detection and removal methods.

Abstract

Data-poisoning backdoor attacks are serious security threats to machine learning models, where an adversary can manipulate the training dataset to inject backdoors into models. In this paper, we focus on in-training backdoor defense, aiming to train a clean model even when the dataset may be potentially poisoned. Unlike most existing methods that primarily detect and remove/unlearn suspicious samples to mitigate malicious backdoor attacks, we propose a novel defense approach called PDB (Proactive Defensive Backdoor). Specifically, PDB leverages the home-field advantage of defenders by proactively injecting a defensive backdoor into the model during training. Taking advantage of controlling the training process, the defensive backdoor is designed to suppress the malicious backdoor effectively while remaining secret to attackers. In addition, we introduce a reversible mapping to determine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Cryptographic Implementations and Security · Security and Verification in Computing

MethodsFocus