Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later
Binxu Wang, John J. Vastola

TL;DR
This paper provides an analytical theory explaining how diffusion models generate images, revealing that the process involves low-dimensional rotations and a progression from outlines to details, akin to painting.
Contribution
The authors derive a closed-form solution to the probability flow ODE, offering a new theoretical understanding of diffusion-based image generation and its similarities to GANs and painting.
Findings
Reverse diffusion resembles low-dimensional rotations.
High-variance features emerge earlier in generation.
Early perturbations significantly influence final images.
Abstract
How do diffusion generative models convert pure noise into meaningful images? In a variety of pretrained diffusion models (including conditional latent space models like Stable Diffusion), we observe that the reverse diffusion process that underlies image generation has the following properties: (i) individual trajectories tend to be low-dimensional and resemble 2D `rotations'; (ii) high-variance scene features like layout tend to emerge earlier, while low-variance details tend to emerge later; and (iii) early perturbations tend to have a greater impact on image content than later perturbations. To understand these phenomena, we derive and study a closed-form solution to the probability flow ODE for a Gaussian distribution, which shows that the reverse diffusion state rotates towards a gradually-specified target on the image manifold. It also shows that generation involves first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis · Aesthetic Perception and Analysis
MethodsDiffusion
