Invisible Clean-Label Backdoor Attacks for Generative Data Augmentation
Ting Xiang, Jinhui Zhao, Changjian Chen, Zhuo Tang

TL;DR
This paper introduces InvLBA, a novel invisible clean-label backdoor attack targeting generative data augmentation, which significantly improves attack success rates while maintaining model accuracy and robustness.
Contribution
We propose InvLBA, a latent feature-level backdoor attack method for generative augmentation, with theoretical guarantees and superior empirical performance over pixel-level approaches.
Findings
InvLBA achieves a 46.43% higher attack success rate on average.
The method maintains almost no reduction in clean accuracy.
InvLBA demonstrates high robustness against state-of-the-art defenses.
Abstract
With the rapid advancement of image generative models, generative data augmentation has become an effective way to enrich training images, especially when only small-scale datasets are available. At the same time, in practical applications, generative data augmentation can be vulnerable to clean-label backdoor attacks, which aim to bypass human inspection. However, based on theoretical analysis and preliminary experiments, we observe that directly applying existing pixel-level clean-label backdoor attack methods (e.g., COMBAT) to generated images results in low attack success rates. This motivates us to move beyond pixel-level triggers and focus instead on the latent feature level. To this end, we propose InvLBA, an invisible clean-label backdoor attack method for generative data augmentation by latent perturbation. We theoretically prove that the generalization of the clean accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Image Enhancement Techniques
