Latent Diffusion Inversion Requires Understanding the Latent Space
Mingxing Rao, Bowen Qu, Daniel Moyer

TL;DR
This paper investigates how latent diffusion models memorize training data, revealing non-uniform and dimension-specific overfitting, and proposes a method to improve privacy by removing less-memorizing latent dimensions.
Contribution
It uncovers the uneven memorization patterns in latent diffusion models and introduces a ranking method to identify and mitigate dimensions contributing to memorization.
Findings
Memorization varies across latent codes and dimensions.
Removing less-memorizing dimensions improves membership inference performance.
Auto-encoder geometry significantly influences model memorization.
Abstract
The recovery of training data from generative models ("model inversion") has been extensively studied for diffusion models in the data domain as a memorization/overfitting phenomenon. Latent diffusion models (LDMs), which operate on the latent codes from encoder/decoder pairs, have been robust to prior inversion methods. In this work we describe two key findings: (1) the diffusion model exhibits non-uniform memorization across latent codes, tending to overfit samples located in high-distortion regions of the decoder pullback metric; (2) even within a single latent code, memorization contributions are unequal across representation dimensions. Our proposed method to ranks latent dimensions by their contribution to the decoder pullback metric, which in turn identifies dimensions that contribute to memorization. For score-based membership inference, a sub-task of model inversion, we find…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques
