Let us Build Bridges: Understanding and Extending Diffusion Generative Models
Xingchao Liu, Lemeng Wu, Mao Ye, Qiang Liu

TL;DR
This paper re-examines diffusion generative models to enhance theoretical understanding and develop versatile algorithms applicable across various data domains, demonstrating improved performance on images, segments, and 3D point clouds.
Contribution
It introduces a unified framework viewing diffusion models as latent variable models, providing theoretical error analysis and methods for diverse data types.
Findings
Effective diffusion bridge processes for deterministic endpoint constraints
First theoretical error analysis for diffusion model learning
Superior performance on images, semantic segments, and 3D point clouds
Abstract
Diffusion-based generative models have achieved promising results recently, but raise an array of open questions in terms of conceptual understanding, theoretical analysis, algorithm improvement and extensions to discrete, structured, non-Euclidean domains. This work tries to re-exam the overall framework, in order to gain better theoretical understandings and develop algorithmic extensions for data from arbitrary domains. By viewing diffusion models as latent variable models with unobserved diffusion trajectories and applying maximum likelihood estimation (MLE) with latent trajectories imputed from an auxiliary distribution, we show that both the model construction and the imputation of latent trajectories amount to constructing diffusion bridge processes that achieve deterministic values and constraints at end point, for which we provide a systematic study and a suit of tools.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
MethodsDiffusion
