WaveAttack: Asymmetric Frequency Obfuscation-based Backdoor Attacks Against Deep Neural Networks
Jun Xia, Zhihao Yue, Yingbo Zhou, Zhiwei Ling, Xian Wei, Mingsong Chen

TL;DR
WaveAttack introduces a novel frequency-based backdoor attack leveraging wavelet transforms and asymmetric obfuscation to improve stealthiness and effectiveness against deep neural networks, outperforming existing methods in image fidelity.
Contribution
This paper presents WaveAttack, a new backdoor attack method using frequency domain features and asymmetric obfuscation to enhance stealth and attack success.
Findings
Achieves higher stealthiness and effectiveness than state-of-the-art methods.
Improves image fidelity metrics significantly, with up to 28.27% PSNR increase.
Reduces Inception Score (IS) by 70.59%.
Abstract
Due to the popularity of Artificial Intelligence (AI) technology, numerous backdoor attacks are designed by adversaries to mislead deep neural network predictions by manipulating training samples and training processes. Although backdoor attacks are effective in various real scenarios, they still suffer from the problems of both low fidelity of poisoned samples and non-negligible transfer in latent space, which make them easily detectable by existing backdoor detection algorithms. To overcome the weakness, this paper proposes a novel frequency-based backdoor attack method named WaveAttack, which obtains image high-frequency features through Discrete Wavelet Transform (DWT) to generate backdoor triggers. Furthermore, we introduce an asymmetric frequency obfuscation method, which can add an adaptive residual in the training and inference stage to improve the impact of triggers and further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
