TL;DR
This paper introduces OSDEnhancer, a novel one-step diffusion framework for space-time video super-resolution that effectively handles real-world complex degradations and enhances both spatial and temporal details.
Contribution
The paper presents the first one-step diffusion-based STVSR method with a divide-and-conquer strategy and specialized LoRAs for improved real-world performance.
Findings
Achieves state-of-the-art results in real-world scenarios.
Demonstrates superior generalization over existing methods.
Effectively recovers fine textures and maintains temporal coherence.
Abstract
Diffusion models have demonstrated exceptional success in video super-resolution (VSR), exhibiting powerful capabilities for generating fine-grained details. However, their potential for space-time video super-resolution (STVSR), which necessitates not only recovering realistic high-resolution visual content but also improving the frame rate with coherent temporal dynamics, remains largely underexplored. Moreover, existing STVSR methods predominantly address spatiotemporal upsampling under simple degradation assumptions, thus failing in real-world scenarios with complex unknown degradations. To address these challenges, we propose OSDEnhancer, the first framework that achieves robust STVSR in one-step diffusion. OSDEnhancer begins with a linear initialization to establish essential spatiotemporal structures and adapt the model for one-step reconstruction. It then applies a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Advanced Vision and Imaging
