Diffusion Model Quantization: A Review
Qian Zeng, Chenggong Hu, Mingli Song, Jie Song

TL;DR
This survey reviews recent advancements in diffusion model quantization, analyzing techniques, benchmarking results, and challenges to facilitate efficient deployment of generative models on resource-limited devices.
Contribution
It provides a comprehensive taxonomy, qualitative and quantitative analysis, and benchmarks of diffusion model quantization methods, highlighting current challenges and future research directions.
Findings
Benchmarking of various quantization methods on standard datasets
Analysis of quantization errors and their visual impacts
Identification of key challenges in quantizing diffusion models
Abstract
Recent success of large text-to-image models has empirically underscored the exceptional performance of diffusion models in generative tasks. To facilitate their efficient deployment on resource-constrained edge devices, model quantization has emerged as a pivotal technique for both compression and acceleration. This survey offers a thorough review of the latest advancements in diffusion model quantization, encapsulating and analyzing the current state of the art in this rapidly advancing domain. First, we provide an overview of the key challenges encountered in the quantization of diffusion models, including those based on U-Net architectures and Diffusion Transformers (DiT). We then present a comprehensive taxonomy of prevalent quantization techniques, engaging in an in-depth discussion of their underlying principles. Subsequently, we perform a meticulous analysis of representative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Computer Graphics and Visualization Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Max Pooling · Diffusion · Concatenated Skip Connection · U-Net
