Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression
Ahmed Ghorbel, Wassim Hamidouche, Luce Morin

TL;DR
This paper introduces an efficient neural image compression framework using a Transformer-based auto-regressive prior and adaptive spatial resolution, achieving better compression quality with lower complexity.
Contribution
It proposes a novel Transformer-based channel-wise auto-regressive prior and a learnable scaling module with ConvNeXt components for improved neural image compression.
Findings
Significantly outperforms VVC and SwinT-ChARM in efficiency and quality.
Reduces decoding latency while maintaining high compression performance.
Demonstrates computational efficiency through model scaling studies.
Abstract
Recently, the performance of neural image compression (NIC) has steadily improved thanks to the last line of study, reaching or outperforming state-of-the-art conventional codecs. Despite significant progress, current NIC methods still rely on ConvNet-based entropy coding, limited in modeling long-range dependencies due to their local connectivity and the increasing number of architectural biases and priors, resulting in complex underperforming models with high decoding latency. Motivated by the efficiency investigation of the Tranformer-based transform coding framework, namely SwinT-ChARM, we propose to enhance the latter, as first, with a more straightforward yet effective Tranformer-based channel-wise auto-regressive prior model, resulting in an absolute image compression transformer (ICT). Through the proposed ICT, we can capture both global and local contexts from the latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Generative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods
