DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution
Zheng Chen, Zichen Zou, Kewei Zhang, Xiongfei Su, Xin Yuan, Yong Guo, Yulun Zhang

TL;DR
DOVE introduces an efficient one-step diffusion model for real-world video super-resolution, significantly speeding up inference while maintaining high restoration quality through a novel training strategy and dataset.
Contribution
The paper presents DOVE, a novel one-step diffusion model for VSR, achieved by fine-tuning a pretrained model with a new training strategy and dataset, enabling fast and high-quality video super-resolution.
Findings
DOVE achieves up to 28× faster inference than existing methods.
DOVE maintains comparable or better super-resolution performance.
The latent-pixel training strategy improves model adaptation to VSR tasks.
Abstract
Diffusion models have demonstrated promising performance in real-world video super-resolution (VSR). However, the dozens of sampling steps they require, make inference extremely slow. Sampling acceleration techniques, particularly single-step, provide a potential solution. Nonetheless, achieving one step in VSR remains challenging, due to the high training overhead on video data and stringent fidelity demands. To tackle the above issues, we propose DOVE, an efficient one-step diffusion model for real-world VSR. DOVE is obtained by fine-tuning a pretrained video diffusion model (i.e., CogVideoX). To effectively train DOVE, we introduce the latent-pixel training strategy. The strategy employs a two-stage scheme to gradually adapt the model to the video super-resolution task. Meanwhile, we design a video processing pipeline to construct a high-quality dataset tailored for VSR, termed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Image and Video Quality Assessment
MethodsDiffusion
