Unconsciously Forget: Mitigating Memorization; Without Knowing What is being Memorized
Er Jin, Yang Zhang, Yongli Mou, Yanfei Dong, Stefan Decker, Kenji Kawaguchi, Johannes Stegmaier

TL;DR
This paper introduces UniForget, a model pruning technique that reduces memorization of copyrighted content in generative models without needing to identify specific concepts, thereby enhancing privacy and legal compliance.
Contribution
UniForget provides a novel, scalable approach to mitigate memorization by pruning responsible model parts, complementing existing unlearning methods without high computational costs.
Findings
Pruning specific model parts reduces copyrighted content generation.
The method preserves overall generative quality.
It is orthogonal and complementary to existing unlearning techniques.
Abstract
Recent advances in generative models have demonstrated an exceptional ability to produce highly realistic images. However, previous studies show that generated images often resemble the training data, and this problem becomes more severe as the model size increases. Memorizing training data can lead to legal challenges, including copyright infringement, violations of portrait rights, and trademark violations. Existing approaches to mitigating memorization mainly focus on manipulating the denoising sampling process to steer image embeddings away from the memorized embedding space or employ unlearning methods that require training on datasets containing specific sets of memorized concepts. However, existing methods often incur substantial computational overhead during sampling, or focus narrowly on removing one or more groups of target concepts, imposing a significant limitation on their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Handwritten Text Recognition Techniques · Multimodal Machine Learning Applications
