Taming Real-World Space-Time Video Super-Resolution with One-Step Diffusion

Shuoyan Wei; Feng Li; Chen Zhou; Runmin Cong; Yao Zhao; Huihui Bai

arXiv:2601.20308·cs.CV·May 20, 2026

Taming Real-World Space-Time Video Super-Resolution with One-Step Diffusion

Shuoyan Wei, Feng Li, Chen Zhou, Runmin Cong, Yao Zhao, Huihui Bai

PDF

1 Repo 1 Models

TL;DR

This paper introduces OSDEnhancer, a novel one-step diffusion framework for space-time video super-resolution that effectively handles real-world complex degradations and enhances both spatial and temporal details.

Contribution

The paper presents the first one-step diffusion-based STVSR method with a divide-and-conquer strategy and specialized LoRAs for improved real-world performance.

Findings

01

Achieves state-of-the-art results in real-world scenarios.

02

Demonstrates superior generalization over existing methods.

03

Effectively recovers fine textures and maintains temporal coherence.

Abstract

Diffusion models have demonstrated exceptional success in video super-resolution (VSR), exhibiting powerful capabilities for generating fine-grained details. However, their potential for space-time video super-resolution (STVSR), which necessitates not only recovering realistic high-resolution visual content but also improving the frame rate with coherent temporal dynamics, remains largely underexplored. Moreover, existing STVSR methods predominantly address spatiotemporal upsampling under simple degradation assumptions, thus failing in real-world scenarios with complex unknown degradations. To address these challenges, we propose OSDEnhancer, the first framework that achieves robust STVSR in one-step diffusion. OSDEnhancer begins with a linear initialization to establish essential spatiotemporal structures and adapt the model for one-step reconstruction. It then applies a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

W-Shuoyan/OSDEnhancer
github

Models

🤗
W-Shuoyan/OSDEnhancer
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Advanced Vision and Imaging