TL;DR
This paper introduces an interpretable framework for space-time video super-resolution that jointly addresses deblurring, interpolation, and super-resolution by combining model-based and learning-based methods, improving video quality.
Contribution
It formulates STVSR as two interleaved sub-problems solved alternately, with an analytical Fourier transform layer and a recurrent enhancement module, offering interpretability and superior performance.
Findings
Outperforms existing methods in quantitative metrics
Produces higher visual quality in reconstructed videos
Effectively handles motion blur and aliasing
Abstract
In this paper, we study a practical space-time video super-resolution (STVSR) problem which aims at generating a high-framerate high-resolution sharp video from a low-framerate low-resolution blurry video. Such problem often occurs when recording a fast dynamic event with a low-framerate and low-resolution camera, and the captured video would suffer from three typical issues: i) motion blur occurs due to object/camera motions during exposure time; ii) motion aliasing is unavoidable when the event temporal frequency exceeds the Nyquist limit of temporal sampling; iii) high-frequency details are lost because of the low spatial sampling rate. These issues can be alleviated by a cascade of three separate sub-tasks, including video deblurring, frame interpolation, and super-resolution, which, however, would fail to capture the spatial and temporal correlations among video sequences. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
