Enhancing Learned Image Compression via Cross Window-based Attention

Priyanka Mudgal; Feng Liu

arXiv:2410.21144·cs.CV·February 14, 2025

Enhancing Learned Image Compression via Cross Window-based Attention

Priyanka Mudgal, Feng Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a CNN-based image compression method that incorporates a feature encoding module and cross-scale window-based attention to better capture local and global redundancies, achieving competitive results.

Contribution

The paper proposes a novel CNN architecture with a feature encoding and cross-scale window-based attention modules that can be integrated into existing networks for improved image compression.

Findings

01

Effective in capturing local redundancy

02

Achieves performance comparable to state-of-the-art methods

03

Flexible modules that can be incorporated into other architectures

Abstract

In recent years, learned image compression methods have demonstrated superior rate-distortion performance compared to traditional image compression methods. Recent methods utilize convolutional neural networks (CNN), variational autoencoders (VAE), invertible neural networks (INN), and transformers. Despite their significant contributions, a main drawback of these models is their poor performance in capturing local redundancy. Therefore, to leverage global features along with local redundancy, we propose a CNN-based solution integrated with a feature encoding module. The feature encoding module encodes important features before feeding them to the CNN and then utilizes cross-scale window-based attention, which further captures local redundancy. Cross-scale window-based attention is inspired by the attention mechanism in transformers and effectively enlarges the receptive field. Both the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

prmudgal/CWAM_IC_ISVC
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques

MethodsSoftmax · Attention Is All You Need