TL;DR
This paper introduces channel-wise autoregressive entropy models with enhancements that improve learned image compression efficiency, achieving significant rate savings and outperforming traditional codecs like BPG.
Contribution
The paper proposes novel channel-conditioning and latent residual prediction techniques that enhance rate-distortion performance while reducing serial processing in learned image codecs.
Findings
Average rate savings of 6.7% on Kodak and 11.4% on Tecnick datasets.
Up to 18% rate savings at low bit rates.
Outperforms hand-engineered codecs like BPG by up to 25%.
Abstract
In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained autoencoder with an entropy model that uses both forward and backward adaptation. Forward adaptation makes use of side information and can be efficiently integrated into a deep neural network. In contrast, backward adaptation typically makes predictions based on the causal context of each symbol, which requires serial processing that prevents efficient GPU / TPU utilization. We introduce two enhancements, channel-conditioning and latent residual prediction, that lead to network architectures with better rate-distortion performance than existing context-adaptive models while minimizing serial processing. Empirically, we see an average rate savings of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSolana Customer Service Number +1-833-534-1729
