Beyond Alignment: Blind Video Face Restoration via Parsing-Guided   Temporal-Coherent Transformer

Kepeng Xu; Li Xu; Gang He; Wenxin Yu; Yunsong Li

arXiv:2404.13640·cs.MM·April 23, 2024

Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

Kepeng Xu, Li Xu, Gang He, Wenxin Yu, Yunsong Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces PGTFormer, a novel blind video face restoration method that uses semantic parsing guidance and a transformer architecture to produce temporally coherent, high-quality face videos without pre-alignment.

Contribution

It presents the first blind video face restoration approach with a parsing-guided transformer that avoids pre-alignment and reduces artifacts, improving temporal coherence and fidelity.

Findings

01

Outperforms previous face restoration methods in experiments.

02

Effectively reduces artifacts and jitter in restored videos.

03

Enhances temporal consistency without face pre-alignment.

Abstract

Multiple complex degradations are coupled in low-quality video faces in the real world. Therefore, blind video face restoration is a highly challenging ill-posed problem, requiring not only hallucinating high-fidelity details but also enhancing temporal coherence across diverse pose variations. Restoring each frame independently in a naive manner inevitably introduces temporal incoherence and artifacts from pose changes and keypoint localization errors. To address this, we propose the first blind video face restoration approach with a novel parsing-guided temporal-coherent transformer (PGTFormer) without pre-alignment. PGTFormer leverages semantic parsing guidance to select optimal face priors for generating temporally coherent artifact-free results. Specifically, we pre-train a temporal-spatial vector quantized auto-encoder on high-quality video face datasets to extract expressive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kepengxu/pgtformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Advanced Image Processing Techniques · Biometric Identification and Security