Diffusion Models and Representation Learning: A Survey
Michael Fuest, Pingchuan Ma, Ming Gui, Johannes Schusterbauer, Vincent, Tao Hu, Bjorn Ommer

TL;DR
This survey reviews the relationship between diffusion models and representation learning, highlighting their mathematical foundations, architectures, and methods to leverage learned representations for recognition tasks.
Contribution
It provides a comprehensive taxonomy and analysis of how diffusion models are integrated with representation learning, including recent advancements and future research directions.
Findings
Diffusion models serve as self-supervised learning methods in vision tasks.
Various frameworks utilize pre-trained diffusion models for recognition.
Advancements in representation learning enhance diffusion model capabilities.
Abstract
Diffusion Models are popular generative modeling methods in various vision tasks, attracting significant attention. They can be considered a unique instance of self-supervised learning methods due to their independence from label annotation. This survey explores the interplay between diffusion models and representation learning. It provides an overview of diffusion models' essential aspects, including mathematical foundations, popular denoising network architectures, and guidance methods. Various approaches related to diffusion models and representation learning are detailed. These include frameworks that leverage representations learned from pre-trained diffusion models for subsequent recognition tasks and methods that utilize advancements in representation and self-supervised learning to enhance diffusion models. This survey aims to offer a comprehensive overview of the taxonomy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Natural Language Processing Techniques · Speech Recognition and Synthesis
MethodsDiffusion
