Model Integrity when Unlearning with T2I Diffusion Models
Andrea Schioppa, Emiel Hoogeboom, Jonathan Heek

TL;DR
This paper examines the impact of machine unlearning algorithms on the integrity of text-to-image diffusion models, introducing a new metric and algorithms that better preserve original model capabilities after unlearning specific data.
Contribution
It introduces a novel retention metric and improved unlearning algorithms that better maintain model integrity in diffusion models after unlearning specific data.
Findings
New retention metric effectively measures model integrity post-unlearning.
Proposed algorithms outperform existing methods in preserving original generation quality.
Algorithms are simple to implement and serve as benchmarks for future research.
Abstract
The rapid advancement of text-to-image Diffusion Models has led to their widespread public accessibility. However these models, trained on large internet datasets, can sometimes generate undesirable outputs. To mitigate this, approximate Machine Unlearning algorithms have been proposed to modify model weights to reduce the generation of specific types of images, characterized by samples from a ``forget distribution'', while preserving the model's ability to generate other images, characterized by samples from a ``retain distribution''. While these methods aim to minimize the influence of training data in the forget distribution without extensive additional computation, we point out that they can compromise the model's integrity by inadvertently affecting generation for images in the retain distribution. Recognizing the limitations of FID and CLIPScore in capturing these effects, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Simulation Techniques and Applications
MethodsDiffusion
