Variance reduction of diffusion model's gradients with Taylor   approximation-based control variate

Paul Jeha; Will Grathwohl; Michael Riis Andersen; Carl Henrik Ek; Jes; Frellsen

arXiv:2408.12270·cs.LG·August 23, 2024

Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

Paul Jeha, Will Grathwohl, Michael Riis Andersen, Carl Henrik Ek, Jes, Frellsen

PDF

Open Access

TL;DR

This paper introduces a Taylor approximation-based control variate to reduce gradient variance in diffusion models, improving training stability and efficiency for high-dimensional data generation.

Contribution

It proposes a novel control variate derived from Taylor expansion, with theoretical proof of equivalence and empirical validation on both low and high-dimensional problems.

Findings

01

Significant variance reduction in gradients.

02

Improved training stability in diffusion models.

03

Effective on both low and high-dimensional data.

Abstract

Score-based models, trained with denoising score matching, are remarkably effective in generating high dimensional data. However, the high variance of their training objective hinders optimisation. We attempt to reduce it with a control variate, derived via a $k$ -th order Taylor expansion on the training objective and its gradient. We prove an equivalence between the two and demonstrate empirically the effectiveness of our approach on a low dimensional problem setting; and study its effect on larger problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Mathematical Modeling in Engineering