Finding NeMo: Localizing Neurons Responsible For Memorization in   Diffusion Models

Dominik Hintersdorf; Lukas Struppek; Kristian Kersting; Adam Dziedzic,; Franziska Boenisch

arXiv:2406.02366·cs.LG·November 5, 2024·2 cites

Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models

Dominik Hintersdorf, Lukas Struppek, Kristian Kersting, Adam Dziedzic,, Franziska Boenisch

PDF

Open Access 1 Repo

TL;DR

This paper introduces NeMo, a method to identify and deactivate specific neurons in diffusion models responsible for memorizing training data, thereby reducing privacy risks and improving output diversity.

Contribution

NeMo is the first approach to localize memorization in diffusion models at the neuron level, enabling targeted mitigation of data leakage.

Findings

01

Single neurons often memorize specific training samples.

02

Deactivating memorization neurons reduces data leakage.

03

Mitigates privacy and copyright concerns in diffusion models.

Abstract

Diffusion models (DMs) produce very detailed and high-quality images. Their power results from extensive training on large amounts of data, usually scraped from the internet without proper attribution or consent from content creators. Unfortunately, this practice raises privacy and intellectual property concerns, as DMs can memorize and later reproduce their potentially sensitive or copyrighted training images at inference time. Prior efforts prevent this issue by either changing the input to the diffusion process, thereby preventing the DM from generating memorized samples during inference, or removing the memorized data from training altogether. While those are viable solutions when the DM is developed and deployed in a secure and constantly monitored environment, they hold the risk of adversaries circumventing the safeguards and are not effective when the DM itself is publicly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ml-research/localizing_memorization_in_diffusion_models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsDiffusion