Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces
Yang Lyu, Tan Minh Nguyen, Yuchun Qian, Xin T. Tong

TL;DR
This paper introduces a simple inertial update to empirical diffusion models that effectively reduces memorization, enabling the generation of diverse, high-quality samples on manifolds without additional training.
Contribution
The work demonstrates that an inertia update at the end of diffusion simulation can eliminate memorization, with theoretical guarantees on distribution approximation independent of ambient space dimension.
Findings
Inertial diffusion reduces memorization in empirical models.
Sample distribution approximates true data on manifolds within a quantifiable Wasserstein distance.
The approach connects diffusion models with manifold learning techniques.
Abstract
Diffusion models are popular tools for generating new data samples, using a forward process that adds noise to data and a reverse process to denoise and produce samples. However, when the data distribution consists of n points, empirical diffusion models tend to reproduce existing data points, a phenomenon known as the memorization effect. Current literature often addresses this with complex machine learning techniques. This work shows that the memorization issue can be solved simply by applying an inertia update at the end of the empirical diffusion simulation. Our inertial diffusion model requires only the empirical score function and no additional training. We demonstrate that the distribution of samples from this model approximates the true data distribution on a manifold of dimension , within a Wasserstein-1 distance of order . This bound…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry · Advanced Neuroimaging Techniques and Applications · Topological and Geometric Data Analysis
MethodsDiffusion
