Diversify, Contextualize, and Adapt: Efficient Entropy Modeling for Neural Image Codec
Jun-Hyuk Kim, Seungeon Kim, Won-Hee Lee, Dokwan Oh

TL;DR
This paper introduces a novel entropy modeling framework for neural image codecs that diversifies hyper latent representations to improve rate-distortion performance without increasing bit-rate, demonstrating significant gains over existing methods.
Contribution
It proposes a simple yet effective entropy modeling framework that diversifies hyper latent representations for better contextual information in neural image codecs.
Findings
Achieves 3.73% BD-rate gain on Kodak dataset.
Consistently improves rate-distortion performance across various bit-rates.
Enhances forward adaptation by diversifying hyper latent contexts.
Abstract
Designing a fast and effective entropy model is challenging but essential for practical application of neural codecs. Beyond spatial autoregressive entropy models, more efficient backward adaptation-based entropy models have been recently developed. They not only reduce decoding time by using smaller number of modeling steps but also maintain or even improve rate--distortion performance by leveraging more diverse contexts for backward adaptation. Despite their significant progress, we argue that their performance has been limited by the simple adoption of the design convention for forward adaptation: using only a single type of hyper latent representation, which does not provide sufficient contextual information, especially in the first modeling step. In this paper, we propose a simple yet effective entropy modeling framework that leverages sufficient contexts for forward adaptation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Cell Image Analysis Techniques
