Fragile by Design: On the Limits of Adversarial Defenses in Personalized Generation

Zhen Chen; Yi Zhang; Xiangyu Yin; Chengxuan Qin; Xingyu Zhao; Xiaowei Huang; Wenjie Ruan

arXiv:2511.10382·cs.CV·November 14, 2025

Fragile by Design: On the Limits of Adversarial Defenses in Personalized Generation

Zhen Chen, Yi Zhang, Xiangyu Yin, Chengxuan Qin, Xingyu Zhao, Xiaowei Huang, Wenjie Ruan

PDF

Open Access

TL;DR

This paper critically examines the limitations of current adversarial defenses in personalized AI content generation, revealing their fragility and perceptibility issues, and introduces a framework to evaluate their robustness against realistic purification threats.

Contribution

The paper identifies key vulnerabilities in existing defenses like Anti-DreamBooth and proposes a systematic evaluation framework, AntiDB_Purify, to assess their robustness under realistic purification attacks.

Findings

01

Current defenses are easily detectable due to perceptible artifacts.

02

Perturbations are fragile and can be removed by simple filters.

03

Existing methods fail under realistic purification threats.

Abstract

Personalized AI applications such as DreamBooth enable the generation of customized content from user images, but also raise significant privacy concerns, particularly the risk of facial identity leakage. Recent defense mechanisms like Anti-DreamBooth attempt to mitigate this risk by injecting adversarial perturbations into user photos to prevent successful personalization. However, we identify two critical yet overlooked limitations of these methods. First, the adversarial examples often exhibit perceptible artifacts such as conspicuous patterns or stripes, making them easily detectable as manipulated content. Second, the perturbations are highly fragile, as even a simple, non-learned filter can effectively remove them, thereby restoring the model's ability to memorize and reproduce user identity. To investigate this vulnerability, we propose a novel evaluation framework,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · User Authentication and Security Systems