Estimating the Rate-Distortion Function by Wasserstein Gradient Descent
Yibo Yang, Stephan Eckstein, Marcel Nutz, Stephan Mandt

TL;DR
This paper introduces a Wasserstein gradient descent method to estimate the rate-distortion function in lossy compression, learning the optimal reproduction distribution support and outperforming neural network methods in efficiency and accuracy.
Contribution
The paper presents a novel Wasserstein gradient descent algorithm for estimating the R-D function, with proven convergence and reduced computational complexity compared to existing methods.
Findings
Achieves comparable or tighter R-D bounds than neural network methods.
Requires less tuning and computational effort.
Provides a new test case with known R-D solutions.
Abstract
In the theory of lossy compression, the rate-distortion (R-D) function describes how much a data source can be compressed (in bit-rate) at any given level of fidelity (distortion). Obtaining for a given data source establishes the fundamental performance limit for all compression algorithms. We propose a new method to estimate from the perspective of optimal transport. Unlike the classic Blahut--Arimoto algorithm which fixes the support of the reproduction distribution in advance, our Wasserstein gradient descent algorithm learns the support of the optimal reproduction distribution by moving particles. We prove its local convergence and analyze the sample complexity of our R-D estimator based on a connection to entropic optimal transport. Experimentally, we obtain comparable or tighter bounds than state-of-the-art neural network methods on low-rate sources while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
