MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution

Linfeng He; Meiqin Liu; Qi Tang; Chao Yao; Yao Zhao

arXiv:2506.11768·cs.CV·June 16, 2025

MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution

Linfeng He, Meiqin Liu, Qi Tang, Chao Yao, Yao Zhao

PDF

Open Access

TL;DR

MambaVSR introduces a novel content-aware state-space model for video super-resolution that effectively models non-local dependencies and improves detail recovery with fewer parameters.

Contribution

It is the first to incorporate a content-aware scanning mechanism within a state-space model framework for VSR, enhancing non-local dependency modeling and efficiency.

Findings

01

Outperforms Transformer-based methods by 0.58 dB PSNR on REDS dataset.

02

Uses 55% fewer parameters than existing methods.

03

Effectively models non-local dependencies and preserves details.

Abstract

Video super-resolution (VSR) faces critical challenges in effectively modeling non-local dependencies across misaligned frames while preserving computational efficiency. Existing VSR methods typically rely on optical flow strategies or transformer architectures, which struggle with large motion displacements and long video sequences. To address this, we propose MambaVSR, the first state-space model framework for VSR that incorporates an innovative content-aware scanning mechanism. Unlike rigid 1D sequential processing in conventional vision Mamba methods, our MambaVSR enables dynamic spatiotemporal interactions through the Shared Compass Construction (SCC) and the Content-Aware Sequentialization (CAS). Specifically, the SCC module constructs intra-frame semantic connectivity graphs via efficient sparse attention and generates adaptive spatial scanning sequences through spectral…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Advanced Vision and Imaging