Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling Augmentation Framework
Ruijia Wu, Yuhang Wang, Huafeng Shi, Zhipeng Yu, Yichao Wu, Ding Liang

TL;DR
This paper introduces ADAF, a novel framework that enhances facial privacy protection in text-to-image models by targeting the image-text fusion module and employing multi-level augmentations for robustness against various prompts.
Contribution
The paper proposes ADAF, a new adversarial decoupling augmentation framework that improves privacy protection by focusing on the image-text fusion module and increasing defense stability.
Findings
ADAF outperforms existing algorithms on CelebA-HQ and VGGFace2 datasets.
The framework effectively enhances robustness against diverse attacker prompts.
Extensive experiments validate ADAF's superior performance.
Abstract
Denoising diffusion models have shown remarkable potential in various generation tasks. The open-source large-scale text-to-image model, Stable Diffusion, becomes prevalent as it can generate realistic artistic or facial images with personalization through fine-tuning on a limited number of new samples. However, this has raised privacy concerns as adversaries can acquire facial images online and fine-tune text-to-image models for malicious editing, leading to baseless scandals, defamation, and disruption to victims' lives. Prior research efforts have focused on deriving adversarial loss from conventional training processes for facial privacy protection through adversarial perturbations. However, existing algorithms face two issues: 1) they neglect the image-text fusion module, which is the vital module of text-to-image diffusion models, and 2) their defensive performance is unstable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Face recognition and analysis
MethodsDiffusion
