AICT: An Adaptive Image Compression Transformer

Ahmed Ghorbel; Wassim Hamidouche; Luce Morin

arXiv:2307.06091·cs.CV·July 13, 2023

AICT: An Adaptive Image Compression Transformer

Ahmed Ghorbel, Wassim Hamidouche, Luce Morin

PDF

Open Access

TL;DR

AICT introduces an adaptive image compression transformer that captures global and local contexts more effectively, leading to improved compression efficiency and image quality compared to existing methods.

Contribution

It proposes a novel Transformer-based auto-regressive prior model and a learnable scaling module with ConvNeXt components for better latent representation and image reconstruction.

Findings

01

Outperforms VVC reference encoder in efficiency

02

Achieves higher quality image reconstruction

03

Enhances global and local context modeling

Abstract

Motivated by the efficiency investigation of the Tranformer-based transform coding framework, namely SwinT-ChARM, we propose to enhance the latter, as first, with a more straightforward yet effective Tranformer-based channel-wise auto-regressive prior model, resulting in an absolute image compression transformer (ICT). Current methods that still rely on ConvNet-based entropy coding are limited in long-range modeling dependencies due to their local connectivity and an increasing number of architectural biases and priors. On the contrary, the proposed ICT can capture both global and local contexts from the latent representations and better parameterize the distribution of the quantized latents. Further, we leverage a learnable scaling module with a sandwich ConvNeXt-based pre/post-processor to accurately extract more compact latent representation while reconstructing higher-quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Advanced Image Processing Techniques · Image and Signal Denoising Methods