Timestep-Aware Correction for Quantized Diffusion Models
Yuzhe Yao, Feng Tian, Jun Chen, Haonan Lin, Guang Dai, Yong Liu,, Jingdong Wang

TL;DR
This paper introduces a timestep-aware correction method for quantized diffusion models that dynamically reduces quantization errors during image generation, significantly improving output quality on resource-limited devices.
Contribution
The paper presents a novel timestep-aware correction technique that addresses error accumulation in low-precision diffusion models, enhancing their performance with minimal additional computation.
Findings
Achieves state-of-the-art results on low-precision diffusion models.
Substantially improves image quality with negligible computational overhead.
Demonstrates generalizability across different diffusion model architectures.
Abstract
Diffusion models have marked a significant breakthrough in the synthesis of semantically coherent images. However, their extensive noise estimation networks and the iterative generation process limit their wider application, particularly on resource-constrained platforms like mobile devices. Existing post-training quantization (PTQ) methods have managed to compress diffusion models to low precision. Nevertheless, due to the iterative nature of diffusion models, quantization errors tend to accumulate throughout the generation process. This accumulation of error becomes particularly problematic in low-precision scenarios, leading to significant distortions in the generated images. We attribute this accumulation issue to two main causes: error propagation and exposure bias. To address these problems, we propose a timestep-aware correction method for quantized diffusion model, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems
MethodsDiffusion
