Fragile by Design: On the Limits of Adversarial Defenses in Personalized Generation
Zhen Chen, Yi Zhang, Xiangyu Yin, Chengxuan Qin, Xingyu Zhao, Xiaowei Huang, Wenjie Ruan

TL;DR
This paper critically examines the limitations of current adversarial defenses in personalized AI content generation, revealing their fragility and perceptibility issues, and introduces a framework to evaluate their robustness against realistic purification threats.
Contribution
The paper identifies key vulnerabilities in existing defenses like Anti-DreamBooth and proposes a systematic evaluation framework, AntiDB_Purify, to assess their robustness under realistic purification attacks.
Findings
Current defenses are easily detectable due to perceptible artifacts.
Perturbations are fragile and can be removed by simple filters.
Existing methods fail under realistic purification threats.
Abstract
Personalized AI applications such as DreamBooth enable the generation of customized content from user images, but also raise significant privacy concerns, particularly the risk of facial identity leakage. Recent defense mechanisms like Anti-DreamBooth attempt to mitigate this risk by injecting adversarial perturbations into user photos to prevent successful personalization. However, we identify two critical yet overlooked limitations of these methods. First, the adversarial examples often exhibit perceptible artifacts such as conspicuous patterns or stripes, making them easily detectable as manipulated content. Second, the perturbations are highly fragile, as even a simple, non-learned filter can effectively remove them, thereby restoring the model's ability to memorize and reproduce user identity. To investigate this vulnerability, we propose a novel evaluation framework,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · User Authentication and Security Systems
