Self-Enhanced Convolutional Network for Facial Video Hallucination
Chaowei Fang, Guanbin Li, Xiaoguang Han, Yizhou Yu

TL;DR
This paper introduces a self-enhanced convolutional network that leverages temporal dependencies in facial videos to improve super-resolution quality, outperforming existing methods in both face-specific and general video super-resolution tasks.
Contribution
The proposed method uniquely exploits preceding frames and temporal coherence to enhance facial video hallucination, addressing alignment and consistency challenges in video super-resolution.
Findings
Outperforms state-of-the-art facial video hallucination methods.
Achieves high-quality results in general video super-resolution.
Demonstrates superior quantitative and qualitative performance.
Abstract
As a domain-specific super-resolution problem, facial image hallucination has enjoyed a series of breakthroughs thanks to the advances of deep convolutional neural networks. However, the direct migration of existing methods to video is still difficult to achieve good performance due to its lack of alignment and consistency modelling in temporal domain. Taking advantage of high inter-frame dependency in videos, we propose a self-enhanced convolutional network for facial video hallucination. It is implemented by making full usage of preceding super-resolved frames and a temporal window of adjacent low-resolution frames. Specifically, the algorithm first obtains the initial high-resolution inference of each frame by taking into consideration a sequence of consecutive low-resolution inputs through temporal consistency modelling. It further recurrently exploits the reconstructed results and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
