EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models
Xuewen Liu, Zhikai Li, Junrui Xiao, Mengjuan Chen, Jianquan Li, and Qingyi Gu

TL;DR
This paper introduces EDA-DM, a post-training quantization method for diffusion models that effectively reduces model size and speeds up inference with minimal performance loss by addressing distribution mismatch issues.
Contribution
The paper proposes EDA-DM, a novel PTQ approach that aligns activation distributions at multiple levels, improving quantization of diffusion models without fine-tuning.
Findings
Achieves 1.83x speedup and 4x compression on Stable-Diffusion
Maintains high image generation quality with only 0.05 loss in CLIP score
Outperforms existing PTQ methods across various models and datasets
Abstract
Diffusion models have achieved great success in image generation tasks. However, the lengthy denoising process and complex neural networks hinder their low-latency applications in real-world scenarios. Quantization can effectively reduce model complexity, and post-training quantization (PTQ), which does not require fine-tuning, is highly promising for compressing and accelerating diffusion models. Unfortunately, we find that due to the highly dynamic activations, existing PTQ methods suffer from distribution mismatch issues at both calibration sample level and reconstruction output level, which makes the performance far from satisfactory. In this paper, we propose EDA-DM, a standardized PTQ method that efficiently addresses the above issues. Specifically, at the calibration sample level, we extract information from the density and diversity of latent space feature maps, which guides the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Radiomics and Machine Learning in Medical Imaging
MethodsDiffusion · ALIGN
