Joint Autoregressive and Hierarchical Priors for Learned Image Compression
David Minnen, Johannes Ball\'e, and George Toderici

TL;DR
This paper introduces a combined autoregressive and hierarchical prior model for learned image compression, achieving state-of-the-art rate-distortion performance and outperforming traditional codecs like BPG on multiple metrics.
Contribution
It proposes a novel combined prior approach that leverages autoregressive and hierarchical models, significantly improving compression efficiency over previous learned methods.
Findings
Achieves 15.8% average reduction in file size over previous state-of-the-art.
Outperforms BPG on both PSNR and MS-SSIM metrics.
Provides the first learning-based method surpassing BPG in image compression.
Abstract
Recent models for learned image compression are based on autoencoders, learning approximately invertible mappings from pixels to a quantized latent representation. These are combined with an entropy model, a prior on the latent representation that can be used with standard arithmetic coding algorithms to yield a compressed bitstream. Recently, hierarchical entropy models have been introduced as a way to exploit more structure in the latents than simple fully factorized priors, improving compression performance while maintaining end-to-end optimization. Inspired by the success of autoregressive priors in probabilistic generative models, we examine autoregressive, hierarchical, as well as combined priors as alternatives, weighing their costs and benefits in the context of image compression. While it is well known that autoregressive models come with a significant computational penalty, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Generative Adversarial Networks and Image Synthesis · Music and Audio Processing
