Imperceptible and Multi-channel Backdoor Attack against Deep Neural Networks
Mingfu Xue, Shifeng Ni, Yinghao Wu, Yushu Zhang, Jian Wang, Weiqiang, Liu

TL;DR
This paper introduces a novel, imperceptible multi-channel backdoor attack on DNNs using DCT steganography, achieving high success rates without affecting model accuracy and resisting existing defenses.
Contribution
It proposes a new multi-channel, multi-target backdoor attack method leveraging DCT steganography, enhancing stealth and robustness against defenses.
Findings
Achieves over 90% attack success rate on CIFAR-10 and TinyImageNet datasets.
Maintains classification accuracy despite the backdoor attack.
Resists state-of-the-art backdoor defense methods like Neural Cleanse.
Abstract
Recent researches demonstrate that Deep Neural Networks (DNN) models are vulnerable to backdoor attacks. The backdoored DNN model will behave maliciously when images containing backdoor triggers arrive. To date, existing backdoor attacks are single-trigger and single-target attacks, and the triggers of most existing backdoor attacks are obvious thus are easy to be detected or noticed. In this paper, we propose a novel imperceptible and multi-channel backdoor attack against Deep Neural Networks by exploiting Discrete Cosine Transform (DCT) steganography. Based on the proposed backdoor attack method, we implement two variants of backdoor attacks, i.e., N-to-N backdoor attack and N-to-One backdoor attack. Specifically, for a colored image, we utilize DCT steganography to construct the trigger on different channels of the image. As a result, the trigger is stealthy and natural. Based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
MethodsDiscrete Cosine Transform
