Efficient Video Face Enhancement with Enhanced Spatial-Temporal   Consistency

Yutong Wang; Jiajie Teng; Jiajiong Cao; Yuming Li; Chenguang Ma,; Hongteng Xu; Dixin Luo

arXiv:2411.16468·cs.CV·November 26, 2024

Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Yutong Wang, Jiajie Teng, Jiajiong Cao, Yuming Li, Chenguang Ma,, Hongteng Xu, Dixin Luo

PDF

Open Access 1 Repo

TL;DR

This paper introduces an efficient blind video face enhancement method that improves quality and temporal consistency of compressed face videos using a novel 3D-VQGAN-based framework with a de-flickering mechanism.

Contribution

The paper proposes a new 3D-VQGAN-based framework with a two-stage learning process and de-flickering for improved face video enhancement, surpassing current methods in efficiency and quality.

Findings

01

Outperforms state-of-the-art methods on VFHQ-Test dataset

02

Reduces flickering and improves visual quality of face videos

03

Achieves faster processing times compared to existing approaches

Abstract

As a very common type of video, face videos often appear in movies, talk shows, live broadcasts, and other scenes. Real-world online videos are often plagued by degradations such as blurring and quantization noise, due to the high compression ratio caused by high communication costs and limited transmission bandwidth. These degradations have a particularly serious impact on face videos because the human visual system is highly sensitive to facial details. Despite the significant advancement in video face enhancement, current methods still suffer from $i)$ long processing time and $ii)$ inconsistent spatial-temporal visual effects (e.g., flickering). This study proposes a novel and efficient blind video face enhancement method to overcome the above two challenges, restoring high-quality videos from their compressed low-quality versions with an effective de-flickering mechanism. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dixin-lab/bfvr-stc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Face and Expression Recognition · Image and Video Stabilization