Manifold Generalization Provably Proceeds Memorization in Diffusion Models

Zebang Shen; Ya-Ping Hsieh; Niao He

arXiv:2603.23792·cs.LG·March 26, 2026

Manifold Generalization Provably Proceeds Memorization in Diffusion Models

Zebang Shen, Ya-Ping Hsieh, Niao He

PDF

Open Access

TL;DR

This paper explains how diffusion models can generate novel samples by leveraging the geometry of data manifolds, achieving faster generalization rates than traditional density estimation, especially with smooth manifolds.

Contribution

It provides a theoretical analysis showing diffusion models trained with coarse scores exploit manifold geometry for efficient generalization, surpassing classical density estimation rates.

Findings

01

Diffusion models can generalize by capturing data geometry rather than full distribution.

02

Coarse scores enable near-parametric rates of convergence on manifold support.

03

Faster generalization occurs when the data manifold is sufficiently smooth.

Abstract

Diffusion models often generate novel samples even when the learned score is only \emph{coarse} -- a phenomenon not accounted for by the standard view of diffusion training as density estimation. In this paper, we show that, under the \emph{manifold hypothesis}, this behavior can instead be explained by coarse scores capturing the \emph{geometry} of the data while discarding the fine-scale distributional structure of the population measure~ $μ_{data}$ . Concretely, whereas estimating the full data distribution $μ_{data}$ supported on a $k$ -dimensional manifold is known to require the classical minimax rate $\tilde{O} (N^{- 1/ k})$ , we prove that diffusion models trained with coarse scores can exploit the \emph{regularity of the manifold support} and attain a near-parametric rate toward a \emph{different} target distribution.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Generative Adversarial Networks and Image Synthesis