M-to-N Backdoor Paradigm: A Multi-Trigger and Multi-Target Attack to Deep Learning Models
Linshan Hou, Zhongyun Hua, Yuhong Li, Yifeng Zheng, Leo Yu, Zhang

TL;DR
This paper introduces an M-to-N backdoor attack paradigm that enables manipulation of inputs to target multiple classes with multiple triggers, significantly enhancing attack effectiveness and robustness against defenses.
Contribution
It proposes a novel M-to-N attack framework allowing multiple triggers per target class, with an invisible trigger generation method that improves attack success and robustness.
Findings
Effective multi-target class attacks demonstrated
Robust against pre-processing and defenses
Increases attack versatility and stealth
Abstract
Deep neural networks (DNNs) are vulnerable to backdoor attacks, where a backdoored model behaves normally with clean inputs but exhibits attacker-specified behaviors upon the inputs containing triggers. Most previous backdoor attacks mainly focus on either the all-to-one or all-to-all paradigm, allowing attackers to manipulate an input to attack a single target class. Besides, the two paradigms rely on a single trigger for backdoor activation, rendering attacks ineffective if the trigger is destroyed. In light of the above, we propose a new -to- attack paradigm that allows an attacker to manipulate any input to attack target classes, and each backdoor of the target classes can be activated by any one of its triggers. Our attack selects clean images from each target class as triggers and leverages our proposed poisoned image generation framework to inject the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
