Post-training Quantization for Text-to-Image Diffusion Models with Progressive Calibration and Activation Relaxing
Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang,, Wenwu Zhu

TL;DR
This paper introduces PCR, a post-training quantization method with progressive calibration and activation relaxing, to efficiently compress text-to-image diffusion models like Stable Diffusion, with improved accuracy and a new benchmark for evaluation.
Contribution
The paper proposes a novel PTQ method for text-to-image diffusion models, including a new benchmark, and achieves the first quantization of Stable Diffusion XL with maintained performance.
Findings
PCR outperforms existing quantization methods on Stable Diffusion.
The new QDiffBench benchmark provides more accurate evaluation.
Stable Diffusion XL is quantized successfully with minimal performance loss.
Abstract
High computational overhead is a troublesome problem for diffusion models. Recent studies have leveraged post-training quantization (PTQ) to compress diffusion models. However, most of them only focus on unconditional models, leaving the quantization of widely-used pretrained text-to-image models, e.g., Stable Diffusion, largely unexplored. In this paper, we propose a novel post-training quantization method PCR (Progressive Calibration and Relaxing) for text-to-image diffusion models, which consists of a progressive calibration strategy that considers the accumulated quantization error across timesteps, and an activation relaxing strategy that improves the performance with negligible cost. Additionally, we demonstrate the previous metrics for text-to-image diffusion model quantization are not accurate due to the distribution gap. To tackle the problem, we propose a novel QDiffBench…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · Generative Adversarial Networks and Image Synthesis
MethodsFocus · Diffusion
