What does guidance do? A fine-grained analysis in a simple setting
Muthu Chidambaram, Khashayar Gatmiry, Sitan Chen, Holden Lee, Jianfeng, Lu

TL;DR
This paper rigorously analyzes the effects of guidance in diffusion models, revealing that guidance does not produce the intended distribution and that large guidance distorts samples, with implications for practical use.
Contribution
It provides a detailed theoretical characterization of guidance dynamics in simple settings, clarifying misconceptions and offering practical insights.
Findings
Guidance causes samples to concentrate near the support boundary as guidance increases.
Large guidance levels lead to samples moving away from the true data support.
Theoretical results are validated through experiments on synthetic data.
Abstract
The use of guidance in diffusion models was originally motivated by the premise that the guidance-modified score is that of the data distribution tilted by a conditional likelihood raised to some power. In this work we clarify this misconception by rigorously proving that guidance fails to sample from the intended tilted distribution. Our main result is to give a fine-grained characterization of the dynamics of guidance in two cases, (1) mixtures of compactly supported distributions and (2) mixtures of Gaussians, which reflect salient properties of guidance that manifest on real-world data. In both cases, we prove that as the guidance parameter increases, the guided model samples more heavily from the boundary of the support of the conditional distribution. We also prove that for any nonzero level of score estimation error, sufficiently large guidance will result in sampling away from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial Cognition and Navigation
MethodsDiffusion
