TextOVSR: Text-Guided Real-World Opera Video Super-Resolution
Hua Chang, Xin Xu, Wei Liu, Jiayi Wu, Kui Jiang, Fei Ma, Qi Tian

TL;DR
This paper introduces TextOVSR, a novel text-guided approach for real-world opera video super-resolution that effectively models degradations and enhances texture reconstruction using semantic guidance.
Contribution
The paper proposes a dual-branch network with textual prompts and a new feature fusion module to improve super-resolution of degraded opera videos.
Findings
Outperforms state-of-the-art methods on OperaLQ benchmark
Effectively models complex real-world degradations
Enhances texture details with semantic guidance
Abstract
Many classic opera videos exhibit poor visual quality due to the limitations of early filming equipment and long-term degradation during storage. Although real-world video super-resolution (RWVSR) has achieved significant advances in recent years, directly applying existing methods to degraded opera videos remains challenging. The difficulties are twofold. First, accurately modeling real-world degradations is complex: simplistic combinations of classical degradation kernels fail to capture the authentic noise distribution, while methods that extract real noise patches from external datasets are prone to style mismatches that introduce visual artifacts. Second, current RWVSR methods, which rely solely on degraded image features, struggle to reconstruct realistic and detailed textures due to a lack of high-level semantic guidance. To address these issues, we propose a Text-guided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Generative Adversarial Networks and Image Synthesis
