A Single Video Super-Resolution GAN for Multiple Downsampling Operators based on Pseudo-Inverse Image Formation Models
Santiago L\'opez-Tapia, Alice Lucas, Rafael Molina, Aggelos, K. Katsaggelos

TL;DR
This paper introduces a robust video super-resolution neural network that handles multiple degradation models using pseudo-inverse image formation and perceptual losses, outperforming existing methods.
Contribution
A novel CNN architecture incorporating pseudo-inverse image formation models and perceptual losses for multi-degradation robustness in video super-resolution.
Findings
Outperforms state-of-the-art methods
Robust to multiple degradation models
Effective in real-world scenarios
Abstract
The popularity of high and ultra-high definition displays has led to the need for methods to improve the quality of videos already obtained at much lower resolutions. Current Video Super-Resolution methods are not robust to mismatch between training and testing degradation models since they are trained against a single degradation model (usually bicubic downsampling). This causes their performance to deteriorate in real-life applications. At the same time, the use of only the Mean Squared Error during learning causes the resulting images to be too smooth. In this work we propose a new Convolutional Neural Network for video super resolution which is robust to multiple degradation models. During training, which is performed on a large dataset of scenes with slow and fast motions, it uses the pseudo-inverse image formation model as part of the network architecture in conjunction with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
