Geometric Transformation-Embedded Mamba for Learned Video Compression
Hao Wei, Yanhui Zhou, Chenyang Ge

TL;DR
This paper introduces a novel learned video compression framework that uses geometric transformations and a direct nonlinear transform strategy, outperforming existing methods in perceptual quality and temporal consistency at low bitrates.
Contribution
The work proposes a cascaded Mamba module with embedded geometric transformations and a locality refinement network, along with a conditional entropy model, to improve learned video compression.
Findings
Outperforms state-of-the-art methods in perceptual quality
Achieves better temporal consistency at low bitrates
Demonstrates effectiveness of geometric transformation embedding
Abstract
Although learned video compression methods have exhibited outstanding performance, most of them typically follow a hybrid coding paradigm that requires explicit motion estimation and compensation, resulting in a complex solution for video compression. In contrast, we introduce a streamlined yet effective video compression framework founded on a direct transform strategy, i.e., nonlinear transform, quantization, and entropy coding. We first develop a cascaded Mamba module (CMM) with different embedded geometric transformations to effectively explore both long-range spatial and temporal dependencies. To improve local spatial representation, we introduce a locality refinement feed-forward network (LRFFN) that incorporates a hybrid convolution block based on difference convolutions. We integrate the proposed CMM and LRFFN into the encoder and decoder of our compression framework. Moreover,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Image Processing Techniques
