Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq, Naveed Akhtar

TL;DR
This survey reviews recent advances in diffusion models for vision, emphasizing design strategies that improve computational efficiency to make these models more practical and accessible for real-world applications.
Contribution
It uniquely focuses on the design choices that enhance efficiency in diffusion models for vision, guiding future research towards more practical implementations.
Findings
Highlighting recent efficient diffusion model designs
Analyzing the impact of design choices on computational cost
Providing future outlooks for efficient diffusion models
Abstract
Diffusion Models (DMs) have demonstrated state-of-the-art performance in content generation without requiring adversarial training. These models are trained using a two-step process. First, a forward - diffusion - process gradually adds noise to a datum (usually an image). Then, a backward - reverse diffusion - process gradually removes the noise to turn it into a sample of the target distribution being modelled. DMs are inspired by non-equilibrium thermodynamics and have inherent high computational complexity. Due to the frequent function evaluations and gradient calculations in high-dimensional spaces, these models incur considerable computational overhead during both training and inference stages. This can not only preclude the democratization of diffusion-based modelling, but also hinder the adaption of diffusion models in real-life applications. Not to mention, the efficiency of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractional Differential Equations Solutions · Mathematical and Theoretical Epidemiology and Ecology Models · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
