Enhanced Residual SwinV2 Transformer for Learned Image Compression
Yongqiang Wang, Feng Liang, Haisheng Fu, Jie Liang, Haipeng Qin,, Junzhe Liang

TL;DR
This paper introduces an enhanced residual SwinV2 transformer framework for learned image compression that maintains high performance while significantly reducing model complexity, making practical deployment more feasible.
Contribution
The paper proposes a novel image compression framework using an enhanced residual SwinV2 transformer with a feature enhancement module, achieving high performance with 56% less model complexity.
Findings
Achieves comparable performance to recent methods on Kodak and Tecnick datasets.
Outperforms traditional codecs like VVC in rate-distortion performance.
Reduces model complexity by 56% compared to recent learned methods.
Abstract
Recently, the deep learning technology has been successfully applied in the field of image compression, leading to superior rate-distortion performance. However, a challenge of many learning-based approaches is that they often achieve better performance via sacrificing complexity, which making practical deployment difficult. To alleviate this issue, in this paper, we propose an effective and efficient learned image compression framework based on an enhanced residual Swinv2 transformer. To enhance the nonlinear representation of images in our framework, we use a feature enhancement module that consists of three consecutive convolutional layers. In the subsequent coding and hyper coding steps, we utilize a SwinV2 transformer-based attention mechanism to process the input image. The SwinV2 model can help to reduce model complexity while maintaining high performance. Experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Advanced Image Processing Techniques · Image Retrieval and Classification Techniques
