Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves

Anand Jerry George; Rodrigo Veiga; Nicolas Macris

arXiv:2502.00336·cs.LG·October 9, 2025

Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves

Anand Jerry George, Rodrigo Veiga, Nicolas Macris

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of diffusion models trained with Denoising Score Matching, revealing how model complexity, data size, and noise samples influence generalization and memorization, supported by precise error expressions.

Contribution

It introduces asymptotically exact formulas for test and train errors in diffusion models with random feature parameterization, elucidating their generalization behaviors.

Findings

01

Test and train errors depend on data and feature ratios.

02

Regimes of generalization and memorization are characterized.

03

Theoretical results align with empirical observations.

Abstract

We theoretically investigate the phenomena of generalization and memorization in diffusion models. Empirical studies suggest that these phenomena are influenced by model complexity and the size of the training dataset. In our experiments, we further observe that the number of noise samples per data sample ( $m$ ) used during Denoising Score Matching (DSM) plays a significant and non-trivial role. We capture these behaviors and shed insights into their mechanisms by deriving asymptotically precise expressions for test and train errors of DSM under a simple theoretical setting. The score function is parameterized by random features neural networks, with the target distribution being $d$ -dimensional Gaussian. We operate in a regime where the dimension $d$ , number of data samples $n$ , and number of features $p$ tend to infinity while keeping the ratios $ψ_{n} = \frac{n}{d}$ and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications