Kernel-Smoothed Scores for Denoising Diffusion: A Bias-Variance Study
Franck Gabriel, Fran\c{c}ois Ged, Maria Han Veiga, Emmanuel Schertzer

TL;DR
This paper investigates the bias-variance trade-off in diffusion models' scores, proposing kernel smoothing to reduce variance, prevent memorization, and improve generative quality, supported by theoretical bounds and experiments.
Contribution
Introduces a kernel-smoothed empirical score for diffusion models, analyzing its bias-variance trade-off and demonstrating its effectiveness in reducing memorization and enhancing generalization.
Findings
Kernel smoothing reduces score variance and memorization.
Regularization via kernel smoothing improves generative diversity.
Theoretical bounds relate smoothing to distribution divergence.
Abstract
Diffusion models now set the benchmark in high-fidelity generative sampling, yet they can, in principle, be prone to memorization. In this case, their learned score overfits the finite dataset so that the reverse-time SDE samples are mostly training points. In this paper, we interpret the empirical score as a noisy version of the true score and show that its covariance matrix is asymptotically a re-weighted data PCA. In large dimension, the small time limit makes the noise variance blow up while simultaneously reducing spatial correlation. To reduce this variance, we introduce a kernel-smoothed empirical score and analyze its bias-variance trade-off. We derive asymptotic bounds on the Kullback-Leibler divergence between the true distribution and the one generated by the modified reverse SDE. Regularization on the score has the same effect as increasing the size of the training dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications
MethodsPrincipal Components Analysis · Diffusion · Sparse Evolutionary Training
