Frame and Feature-Context Video Super-Resolution
Bo Yan, Chuming Lin, Weimin Tan

TL;DR
This paper introduces FFCVSR, an end-to-end trainable video super-resolution network that effectively combines local and contextual information from multiple frames to produce high-quality, temporally consistent results in real-time.
Contribution
The paper proposes a novel FFCVSR network with local and context sub-networks, improving temporal consistency and visual quality over existing methods.
Findings
Achieves state-of-the-art accuracy on benchmark datasets.
Maintains real-time processing speed.
Produces visually superior, temporally consistent videos.
Abstract
For video super-resolution, current state-of-the-art approaches either process multiple low-resolution (LR) frames to produce each output high-resolution (HR) frame separately in a sliding window fashion or recurrently exploit the previously estimated HR frames to super-resolve the following frame. The main weaknesses of these approaches are: 1) separately generating each output frame may obtain high-quality HR estimates while resulting in unsatisfactory flickering artifacts, and 2) combining previously generated HR frames can produce temporally consistent results in the case of short information flow, but it will cause significant jitter and jagged artifacts because the previous super-resolving errors are constantly accumulated to the subsequent frames. In this paper, we propose a fully end-to-end trainable frame and feature-context video super-resolution (FFCVSR) network that consists…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image and Signal Denoising Methods
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
