Kernel-Smoothed Scores for Denoising Diffusion: A Bias-Variance Study

Franck Gabriel; Fran\c{c}ois Ged; Maria Han Veiga; Emmanuel Schertzer

arXiv:2505.22841·cs.LG·May 30, 2025

Kernel-Smoothed Scores for Denoising Diffusion: A Bias-Variance Study

Franck Gabriel, Fran\c{c}ois Ged, Maria Han Veiga, Emmanuel Schertzer

PDF

Open Access

TL;DR

This paper investigates the bias-variance trade-off in diffusion models' scores, proposing kernel smoothing to reduce variance, prevent memorization, and improve generative quality, supported by theoretical bounds and experiments.

Contribution

Introduces a kernel-smoothed empirical score for diffusion models, analyzing its bias-variance trade-off and demonstrating its effectiveness in reducing memorization and enhancing generalization.

Findings

01

Kernel smoothing reduces score variance and memorization.

02

Regularization via kernel smoothing improves generative diversity.

03

Theoretical bounds relate smoothing to distribution divergence.

Abstract

Diffusion models now set the benchmark in high-fidelity generative sampling, yet they can, in principle, be prone to memorization. In this case, their learned score overfits the finite dataset so that the reverse-time SDE samples are mostly training points. In this paper, we interpret the empirical score as a noisy version of the true score and show that its covariance matrix is asymptotically a re-weighted data PCA. In large dimension, the small time limit makes the noise variance blow up while simultaneously reducing spatial correlation. To reduce this variance, we introduce a kernel-smoothed empirical score and analyze its bias-variance trade-off. We derive asymptotic bounds on the Kullback-Leibler divergence between the true distribution and the one generated by the modified reverse SDE. Regularization on the score has the same effect as increasing the size of the training dataset,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications

MethodsPrincipal Components Analysis · Diffusion · Sparse Evolutionary Training