Invisible Backdoor Attack with Sample-Specific Triggers

Yuezun Li; Yiming Li; Baoyuan Wu; Longkang Li; Ran He; and Siwei Lyu

arXiv:2012.03816·cs.CR·August 16, 2021

Invisible Backdoor Attack with Sample-Specific Triggers

Yuezun Li, Yiming Li, Baoyuan Wu, Longkang Li, Ran He, and Siwei Lyu

PDF

1 Repo

TL;DR

This paper introduces a novel backdoor attack where triggers are sample-specific and invisible, making it harder to detect and defend against, by encoding triggers into images using steganography techniques.

Contribution

The work proposes a new sample-specific, invisible trigger generation method for backdoor attacks, enhancing attack stealth and effectiveness against existing defenses.

Findings

01

Effective attack on DNNs with sample-specific triggers

02

Triggers are invisible and hard to detect

03

Attack remains successful against defenses

Abstract

Recently, backdoor attacks pose a new security threat to the training process of deep neural networks (DNNs). Attackers intend to inject hidden backdoors into DNNs, such that the attacked model performs well on benign samples, whereas its prediction will be maliciously changed if hidden backdoors are activated by the attacker-defined trigger. Existing backdoor attacks usually adopt the setting that triggers are sample-agnostic, $i . e .,$ different poisoned samples contain the same trigger, resulting in that the attacks could be easily mitigated by current backdoor defenses. In this work, we explore a novel attack paradigm, where backdoor triggers are sample-specific. In our attack, we only need to modify certain training samples with invisible perturbation, while not need to manipulate other training components ( $e . g .$ , training loss, and model structure) as required in many existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuezunli/issba
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.