MambaOVSR: Multiscale Fusion with Global Motion Modeling for Chinese Opera Video Super-Resolution
Hua Chang, Xin Xu, Wei Liu, Wei Wang, Xin Yuan, Kui Jiang

TL;DR
This paper introduces MambaOVSR, a novel multiscale fusion network with global motion modeling for Chinese opera video super-resolution, addressing dataset scarcity and large motion challenges to improve video quality.
Contribution
It presents a new large-scale Chinese Opera Video Clip dataset and a multiscale fusion network with three novel modules for enhanced space-time super-resolution.
Findings
Outperforms state-of-the-art methods by 1.86 dB PSNR on COVC dataset.
Introduces effective global motion modeling for opera videos.
Provides publicly available dataset and code.
Abstract
Chinese opera is celebrated for preserving classical art. However, early filming equipment limitations have degraded videos of last-century performances by renowned artists (e.g., low frame rates and resolution), hindering archival efforts. Although space-time video super-resolution (STVSR) has advanced significantly, applying it directly to opera videos remains challenging. The scarcity of datasets impedes the recovery of high frequency details, and existing STVSR methods lack global modeling capabilities, compromising visual quality when handling opera's characteristic large motions. To address these challenges, we pioneer a large scale Chinese Opera Video Clip (COVC) dataset and propose the Mamba-based multiscale fusion network for space-time Opera Video Super-Resolution (MambaOVSR). Specifically, MambaOVSR involves three novel components: the Global Fusion Module (GFM) for motion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis
