Beyond Memorization: Selective Learning for Copyright-Safe Diffusion Model Training
Divya Kothandaraman, Jaclyn Pytlarz

TL;DR
This paper presents a gradient projection technique for diffusion models that selectively excludes sensitive features during training, significantly reducing memorization of proprietary data while maintaining high-quality image generation.
Contribution
The authors introduce a novel gradient projection method for concept-level feature exclusion, enhancing privacy and IP safety in diffusion model training without sacrificing performance.
Findings
Reduces memorization of sensitive features by over 90%
Maintains image quality and semantic fidelity
Seamlessly integrates with existing training pipelines
Abstract
Memorization in large-scale text-to-image diffusion models poses significant security and intellectual property risks, enabling adversarial attribute extraction and the unauthorized reproduction of sensitive or proprietary features. While conventional dememorization techniques, such as regularization and data filtering, limit overfitting to specific training examples, they fail to systematically prevent the internalization of prohibited concept-level features. Simply discarding all images containing a sensitive feature wastes invaluable training data, necessitating a method for selective learning at the concept level. We introduce a gradient projection method designed to enforce a stringent requirement of concept-level feature exclusion. Our defense operates during backpropagation by systematically identifying and excising training signals aligned with embeddings of prohibited…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Malware Detection Techniques
