TL;DR
SAMIC introduces a lightweight, semantic-aware perceptual image compression method leveraging state space models, improving rate-distortion-perception tradeoff and reducing model complexity with innovative modules.
Contribution
It develops a semantic-aware Mamba block and an SVD-inspired redundancy reduction module, enhancing semantic continuity and efficiency in perceptual image compression.
Findings
Outperforms state-of-the-art methods in rate-distortion-perception tradeoff.
Reduces model complexity compared to existing approaches.
Demonstrates effective long-range modeling with linear computational complexity.
Abstract
Perceptual image compression focuses on preserving high visual quality under low-bitrate constraints. Most existing approaches to perceptual compression leverage the strong generative capabilities of generative adversarial networks or diffusion models, at the cost of substantial model complexity. To this end, we present an efficient perceptual image compression method that exploits the long-range modeling capability and linear computational complexity of state space models, with a particular focus on Mamba. Unlike existing methods that rely on an inherently fixed scanning order and consequently impair semantic continuity and spatial correlation, we develop a semantic-aware Mamba block (SAMB) to enable scanning guided by dynamically clustered semantic features, thereby alleviating the strict causality constraints and long-range information decay inherent to Mamba. Inspired by singular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
