A Survey on Generative Diffusion Model
Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu, Guangyong Chen,, Pheng-Ann Heng, and Stan Z. Li

TL;DR
This survey comprehensively reviews the development, algorithms, and applications of diffusion models, highlighting their role in advancing generative AI across various domains and future research directions.
Contribution
It provides an in-depth, structured overview of diffusion models' evolution, algorithmic improvements, and diverse applications, offering valuable insights for future research.
Findings
Diffusion models are pivotal in generative AI across multiple domains.
The survey details recent algorithmic enhancements and their impact.
Future directions include improving efficiency and expanding applications.
Abstract
Deep generative models have unlocked another profound realm of human creativity. By capturing and generalizing patterns within data, we have entered the epoch of all-encompassing Artificial Intelligence for General Creativity (AIGC). Notably, diffusion models, recognized as one of the paramount generative models, materialize human ideation into tangible instances across diverse domains, encompassing imagery, text, speech, biology, and healthcare. To provide advanced and comprehensive insights into diffusion, this survey comprehensively elucidates its developmental trajectory and future directions from three distinct angles: the fundamental formulation of diffusion, algorithmic enhancements, and the manifold applications of diffusion. Each layer is meticulously explored to offer a profound comprehension of its evolution. Structured and summarized approaches are presented in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCreativity in Education and Neuroscience · Private Equity and Venture Capital · Artificial Intelligence in Healthcare and Education
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion
