ConvNeXt-ChARM: ConvNeXt-based Transform for Efficient Neural Image   Compression

Ahmed Ghorbel; Wassim Hamidouche; Luce Morin

arXiv:2307.06342·cs.CV·July 14, 2023

ConvNeXt-ChARM: ConvNeXt-based Transform for Efficient Neural Image Compression

Ahmed Ghorbel, Wassim Hamidouche, Luce Morin

PDF

Open Access

TL;DR

ConvNeXt-ChARM introduces an efficient neural image compression framework leveraging ConvNeXt architecture and a channel-wise auto-regressive prior, achieving significant rate-distortion improvements over existing methods while maintaining computational efficiency.

Contribution

The paper presents a novel ConvNeXt-based transform coding framework with a compute-efficient prior, outperforming state-of-the-art methods in neural image compression.

Findings

01

Achieves 5.24% BD-rate reduction over VVC encoder

02

Achieves 1.22% BD-rate reduction over SwinT-ChARM

03

Demonstrates computational efficiency and superior image quality

Abstract

Over the last few years, neural image compression has gained wide attention from research and industry, yielding promising end-to-end deep neural codecs outperforming their conventional counterparts in rate-distortion performance. Despite significant advancement, current methods, including attention-based transform coding, still need to be improved in reducing the coding rate while preserving the reconstruction fidelity, especially in non-homogeneous textured image areas. Those models also require more parameters and a higher decoding time. To tackle the above challenges, we propose ConvNeXt-ChARM, an efficient ConvNeXt-based transform coding framework, paired with a compute-efficient channel-wise auto-regressive prior to capturing both global and local contexts from the hyper and quantized latent representations. The proposed architecture can be optimized end-to-end to fully exploit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Linear Layer · Residual Connection · Adam · Dropout · Layer Normalization · Stochastic Depth