CMamba: Learned Image Compression with State Space Models
Zhuojie Wu, Heming Du, Shuyun Wang, Ming Lu, Haiyang Sun, Yandong Guo,, Xin Yu

TL;DR
CMamba is a hybrid image compression framework combining CNNs and state space models to enhance rate-distortion performance while maintaining low computational complexity.
Contribution
It introduces a Content-Adaptive SSM module and a Context-Aware Entropy module to effectively fuse global and local features and reduce redundancies in latent representations.
Findings
Achieves superior rate-distortion performance.
Reduces computational complexity compared to existing methods.
Effectively preserves high-frequency details during compression.
Abstract
Learned Image Compression (LIC) has explored various architectures, such as Convolutional Neural Networks (CNNs) and transformers, in modeling image content distributions in order to achieve compression effectiveness. However, achieving high rate-distortion performance while maintaining low computational complexity (\ie, parameters, FLOPs, and latency) remains challenging. In this paper, we propose a hybrid Convolution and State Space Models (SSMs) based image compression framework, termed \textit{CMamba}, to achieve superior rate-distortion performance with low computational complexity. Specifically, CMamba introduces two key components: a Content-Adaptive SSM (CA-SSM) module and a Context-Aware Entropy (CAE) module. First, we observed that SSMs excel in modeling overall content but tend to lose high-frequency details. In contrast, CNNs are proficient at capturing local details.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques
MethodsConvolution
