Diffusion Product Quantization

Jie Shao; Hanxiao Zhang; Jianxin Wu

arXiv:2411.12306·cs.CV·November 20, 2024

Diffusion Product Quantization

Jie Shao, Hanxiao Zhang, Jianxin Wu

PDF

Open Access

TL;DR

This paper introduces a product quantization method for diffusion models that significantly reduces model size while maintaining performance, enabling extreme compression down to 1 bit.

Contribution

The authors develop a novel product quantization approach with importance-based codebook compression and end-to-end calibration for diffusion models.

Findings

01

Achieves over 24x model size reduction to 1 bit.

02

Outperforms existing quantization methods on ImageNet.

03

Maintains competitive generative quality after compression.

Abstract

In this work, we explore the quantization of diffusion models in extreme compression regimes to reduce model size while maintaining performance. We begin by investigating classical vector quantization but find that diffusion models are particularly susceptible to quantization error, with the codebook size limiting generation quality. To address this, we introduce product quantization, which offers improved reconstruction precision and larger capacity -- crucial for preserving the generative capabilities of diffusion models. Furthermore, we propose a method to compress the codebook by evaluating the importance of each vector and removing redundancy, ensuring the model size remaining within the desired range. We also introduce an end-to-end calibration approach that adjusts assignments during the forward pass and optimizes the codebook using the DDPM loss. By compressing the model to as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Medical Imaging Techniques and Applications

MethodsDiffusion