Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Yutong Wang, Jiajie Teng, Jiajiong Cao, Yuming Li, Chenguang Ma,, Hongteng Xu, Dixin Luo

TL;DR
This paper introduces an efficient blind video face enhancement method that improves quality and temporal consistency of compressed face videos using a novel 3D-VQGAN-based framework with a de-flickering mechanism.
Contribution
The paper proposes a new 3D-VQGAN-based framework with a two-stage learning process and de-flickering for improved face video enhancement, surpassing current methods in efficiency and quality.
Findings
Outperforms state-of-the-art methods on VFHQ-Test dataset
Reduces flickering and improves visual quality of face videos
Achieves faster processing times compared to existing approaches
Abstract
As a very common type of video, face videos often appear in movies, talk shows, live broadcasts, and other scenes. Real-world online videos are often plagued by degradations such as blurring and quantization noise, due to the high compression ratio caused by high communication costs and limited transmission bandwidth. These degradations have a particularly serious impact on face videos because the human visual system is highly sensitive to facial details. Despite the significant advancement in video face enhancement, current methods still suffer from long processing time and inconsistent spatial-temporal visual effects (e.g., flickering). This study proposes a novel and efficient blind video face enhancement method to overcome the above two challenges, restoring high-quality videos from their compressed low-quality versions with an effective de-flickering mechanism. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Image and Video Stabilization
