Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold

Ye He; Yitong Qiu; Molei Tao

arXiv:2602.06021·stat.ML·May 14, 2026

Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold

Ye He, Yitong Qiu, Molei Tao

PDF

TL;DR

This paper characterizes the generalization of diffusion models by analyzing their geometric behavior relative to data-dependent ridge manifolds, revealing a reach-align-slide mechanism during sample generation.

Contribution

It introduces a novel geometric framework using time-dependent ridge manifolds to understand diffusion model generalization and connects it to training dynamics, supported by experiments.

Findings

01

Generated samples first approach the ridge manifold

02

Distance to the ridge is influenced by training error

03

Motion along the ridge is governed by learned error components

Abstract

We study a data-dependent notion of diffusion-model generalization: when a model does not memorize the training set, where do its generated samples go relative to the geometry induced by the data? To answer this, we introduce a time-dependent family of log-density ridge manifolds constructed from the smoothed empirical distribution, and use it to characterize reverse-time inference. Our main result shows that generated samples evolve by a reach-align-slide mechanism: they first enter a neighborhood of the ridge, then their distance to the ridge is controlled by the normal component of training error, and finally their motion along the ridge is controlled by the tangential component. We further connect this geometric picture to training dynamics through directional decompositions of the learned error, and make this link explicit for random feature models, where architectural bias and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.