Real-World Super-Resolution of Face-Images from Surveillance Cameras
Andreas Aakerberg, Kamal Nasrollahi, Thomas B. Moeslund

TL;DR
This paper introduces a realistic training data generation framework for face image super-resolution from surveillance footage, improving reconstruction quality by modeling real-world image artifacts and using GANs with perceptual loss.
Contribution
It presents a novel method to generate realistic LR/HR training pairs, enabling better super-resolution models for real surveillance images, and demonstrates improved results over existing methods.
Findings
More detailed face reconstructions with less noise.
Traditional IQA metrics fail to capture perceptual improvements.
NIMA correlates better with human perception for quality assessment.
Abstract
Most existing face image Super-Resolution (SR) methods assume that the Low-Resolution (LR) images were artificially downsampled from High-Resolution (HR) images with bicubic interpolation. This operation changes the natural image characteristics and reduces noise. Hence, SR methods trained on such data most often fail to produce good results when applied to real LR images. To solve this problem, we propose a novel framework for generation of realistic LR/HR training pairs. Our framework estimates realistic blur kernels, noise distributions, and JPEG compression artifacts to generate LR images with similar image characteristics as the ones in the source domain. This allows us to train a SR model using high quality face images as Ground-Truth (GT). For better perceptual quality we use a Generative Adversarial Network (GAN) based SR model where we have exchanged the commonly used VGG-loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsNeural Image Assessment
