An Improved Upper Bound on the Rate-Distortion Function of Images

Zhihao Duan; Jack Ma; Jiangpeng He; Fengqing Zhu

arXiv:2309.02574·eess.IV·September 7, 2023·ICIP

An Improved Upper Bound on the Rate-Distortion Function of Images

Zhihao Duan, Jack Ma, Jiangpeng He, Fengqing Zhu

PDF

Open Access 1 Repo

TL;DR

This paper presents an improved upper bound on the image rate-distortion function using a new VAE architecture, variable-rate techniques, and a stabilization method, demonstrating significant potential for enhancing lossy image compression.

Contribution

Introduces a novel VAE model and training stabilization method to better estimate the image rate-distortion function, enabling more effective lossy compression.

Findings

01

Achieves at least 30% BD-rate reduction compared to VVC intra prediction.

02

Demonstrates the effectiveness of the new VAE architecture and training method.

03

Provides publicly available code for reproducibility.

Abstract

Recent work has shown that Variational Autoencoders (VAEs) can be used to upper-bound the information rate-distortion (R-D) function of images, i.e., the fundamental limit of lossy image compression. In this paper, we report an improved upper bound on the R-D function of images implemented by (1) introducing a new VAE model architecture, (2) applying variable-rate compression techniques, and (3) proposing a novel \ourfunction{} to stabilize training. We demonstrate that at least 30\% BD-rate reduction w.r.t. the intra prediction mode in VVC codec is achievable, suggesting that there is still great potential for improving lossy image compression. Code is made publicly available at https://github.com/duanzhiihao/lossy-vae.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

duanzhiihao/lossy-vae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques