Conditional Image Synthesis with Diffusion Models: A Survey
Zheyuan Zhan, Defang Chen, Jian-Ping Mei, Zhenghe Zhao, Jiawei Chen, Chun Chen, Siwei Lyu, Can Wang

TL;DR
This survey reviews recent advances in diffusion-based models for conditional image synthesis, categorizing conditioning mechanisms, discussing core principles, and highlighting challenges and future directions.
Contribution
It provides a comprehensive categorization and analysis of conditioning methods in diffusion models, clarifying their principles, advantages, and challenges in conditional image synthesis.
Findings
Categorization of conditioning mechanisms during training and sampling
Analysis of principles and advantages of different conditioning approaches
Identification of unresolved challenges and future research directions
Abstract
Conditional image synthesis based on user-specified requirements is a key component in creating complex visual content. In recent years, diffusion-based generative modeling has become a highly effective way for conditional image synthesis, leading to exponential growth in the literature. However, the complexity of diffusion-based modeling, the wide range of image synthesis tasks, and the diversity of conditioning mechanisms present significant challenges for researchers to keep up with rapid developments and to understand the core concepts on this topic. In this survey, we categorize existing works based on how conditions are integrated into the two fundamental components of diffusion-based modeling, , the denoising network and the sampling process. We specifically highlight the underlying principles, advantages, and potential challenges of various conditioning approaches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Advanced Vision and Imaging
