ConvNeXt-ChARM: ConvNeXt-based Transform for Efficient Neural Image Compression
Ahmed Ghorbel, Wassim Hamidouche, Luce Morin

TL;DR
ConvNeXt-ChARM introduces an efficient neural image compression framework leveraging ConvNeXt architecture and a channel-wise auto-regressive prior, achieving significant rate-distortion improvements over existing methods while maintaining computational efficiency.
Contribution
The paper presents a novel ConvNeXt-based transform coding framework with a compute-efficient prior, outperforming state-of-the-art methods in neural image compression.
Findings
Achieves 5.24% BD-rate reduction over VVC encoder
Achieves 1.22% BD-rate reduction over SwinT-ChARM
Demonstrates computational efficiency and superior image quality
Abstract
Over the last few years, neural image compression has gained wide attention from research and industry, yielding promising end-to-end deep neural codecs outperforming their conventional counterparts in rate-distortion performance. Despite significant advancement, current methods, including attention-based transform coding, still need to be improved in reducing the coding rate while preserving the reconstruction fidelity, especially in non-homogeneous textured image areas. Those models also require more parameters and a higher decoding time. To tackle the above challenges, we propose ConvNeXt-ChARM, an efficient ConvNeXt-based transform coding framework, paired with a compute-efficient channel-wise auto-regressive prior to capturing both global and local contexts from the hyper and quantized latent representations. The proposed architecture can be optimized end-to-end to fully exploit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Linear Layer · Residual Connection · Adam · Dropout · Layer Normalization · Stochastic Depth
