Targeted Data Protection for Diffusion Model by Matching Training Trajectory
Hojun Lee, Mijin Koo, Yeji Song, Nojun Kwak

TL;DR
This paper introduces TAFAP, a novel method for targeted data protection in diffusion models, enabling persistent, controllable, and verifiable transformations during training to safeguard against unauthorized data use.
Contribution
TAFAP is the first approach to control entire training trajectories for effective, stable targeted data protection in diffusion models, surpassing snapshot-based methods.
Findings
TAFAP achieves successful targeted transformations in diffusion models.
The method maintains high image quality while redirecting towards target concepts.
TAFAP outperforms existing TDP methods in controllability and robustness.
Abstract
Recent advancements in diffusion models have made fine-tuning text-to-image models for personalization increasingly accessible, but have also raised significant concerns regarding unauthorized data usage and privacy infringement. Current protection methods are limited to passively degrading image quality, failing to achieve stable control. While Targeted Data Protection (TDP) offers a promising paradigm for active redirection toward user-specified target concepts, existing TDP attempts suffer from poor controllability due to snapshot-matching approaches that fail to account for complete learning dynamics. We introduce TAFAP (Trajectory Alignment via Fine-tuning with Adversarial Perturbations), the first method to successfully achieve effective TDP by controlling the entire training trajectory. Unlike snapshot-based methods whose protective influence is easily diluted as training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Face recognition and analysis
