Beyond Memorization: Selective Learning for Copyright-Safe Diffusion Model Training

Divya Kothandaraman; Jaclyn Pytlarz

arXiv:2512.11194·cs.LG·January 28, 2026

Beyond Memorization: Selective Learning for Copyright-Safe Diffusion Model Training

Divya Kothandaraman, Jaclyn Pytlarz

PDF

Open Access

TL;DR

This paper presents a gradient projection technique for diffusion models that selectively excludes sensitive features during training, significantly reducing memorization of proprietary data while maintaining high-quality image generation.

Contribution

The authors introduce a novel gradient projection method for concept-level feature exclusion, enhancing privacy and IP safety in diffusion model training without sacrificing performance.

Findings

01

Reduces memorization of sensitive features by over 90%

02

Maintains image quality and semantic fidelity

03

Seamlessly integrates with existing training pipelines

Abstract

Memorization in large-scale text-to-image diffusion models poses significant security and intellectual property risks, enabling adversarial attribute extraction and the unauthorized reproduction of sensitive or proprietary features. While conventional dememorization techniques, such as regularization and data filtering, limit overfitting to specific training examples, they fail to systematically prevent the internalization of prohibited concept-level features. Simply discarding all images containing a sensitive feature wastes invaluable training data, necessitating a method for selective learning at the concept level. We introduce a gradient projection method designed to enforce a stringent requirement of concept-level feature exclusion. Our defense operates during backpropagation by systematically identifying and excising training signals aligned with embeddings of prohibited…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Malware Detection Techniques