Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression

Shiyin Jiang; Wei Long; Minghao Han; Zhenghao Chen; Ce Zhu; Shuhang Gu

arXiv:2604.10546·cs.CV·May 5, 2026

Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression

Shiyin Jiang, Wei Long, Minghao Han, Zhenghao Chen, Ce Zhu, Shuhang Gu

PDF

1 Repo

TL;DR

RDVQ introduces a differentiable, end-to-end optimized vector quantization framework for ultra-low bitrate image compression, achieving high perceptual quality with fewer parameters.

Contribution

It presents a novel differentiable relaxation of the codebook distribution enabling joint rate-distortion optimization in VQ-based image compression.

Findings

01

Achieves up to 75.71% bitrate reduction on DISTS metric.

02

Attains competitive or superior perceptual quality at extremely low bitrates.

03

Uses a lightweight architecture with significantly fewer parameters.

Abstract

The rapid growth of visual data under stringent storage and bandwidth constraints makes extremely low-bitrate image compression increasingly important. While Vector Quantization (VQ) offers strong structural fidelity, existing methods lack a principled mechanism for joint rate-distortion (RD) optimization due to the disconnect between representation learning and entropy modeling. We propose RDVQ, a unified framework that enables end-to-end RD optimization for VQ-based compression via a differentiable relaxation of the codebook distribution, allowing the entropy loss to directly shape the latent prior. We further develop an autoregressive entropy model that supports accurate entropy modeling and test-time rate control. Extensive experiments demonstrate that RDVQ achieves strong performance at extremely low bitrates with a lightweight architecture, attaining competitive or superior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CVL-UESTC/RDVQ
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.