Manifold Generalization Provably Proceeds Memorization in Diffusion Models
Zebang Shen, Ya-Ping Hsieh, Niao He

TL;DR
This paper explains how diffusion models can generate novel samples by leveraging the geometry of data manifolds, achieving faster generalization rates than traditional density estimation, especially with smooth manifolds.
Contribution
It provides a theoretical analysis showing diffusion models trained with coarse scores exploit manifold geometry for efficient generalization, surpassing classical density estimation rates.
Findings
Diffusion models can generalize by capturing data geometry rather than full distribution.
Coarse scores enable near-parametric rates of convergence on manifold support.
Faster generalization occurs when the data manifold is sufficiently smooth.
Abstract
Diffusion models often generate novel samples even when the learned score is only \emph{coarse} -- a phenomenon not accounted for by the standard view of diffusion training as density estimation. In this paper, we show that, under the \emph{manifold hypothesis}, this behavior can instead be explained by coarse scores capturing the \emph{geometry} of the data while discarding the fine-scale distributional structure of the population measure~. Concretely, whereas estimating the full data distribution supported on a -dimensional manifold is known to require the classical minimax rate , we prove that diffusion models trained with coarse scores can exploit the \emph{regularity of the manifold support} and attain a near-parametric rate toward a \emph{different} target distribution.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Generative Adversarial Networks and Image Synthesis
