HyperLips: Hyper Control Lips with High Resolution Decoder for Talking   Face Generation

Yaosen Chen; Yu Yao; Zhiqiang Li; Wei Wang; Yanru Zhang; Han Yang,; Xuming Wen

arXiv:2310.05720·cs.CV·October 17, 2023·1 cites

HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation

Yaosen Chen, Yu Yao, Zhiqiang Li, Wei Wang, Yanru Zhang, Han Yang,, Xuming Wen

PDF

Open Access 1 Repo

TL;DR

HyperLips is a two-stage framework that enhances talking face generation by controlling lip movements with a hypernetwork and producing high-resolution, realistic videos with a dedicated decoder, outperforming existing methods.

Contribution

The paper introduces HyperLips, a novel two-stage approach combining a hypernetwork for lip control and a high-resolution decoder for improved visual quality in talking face generation.

Findings

01

Outperforms state-of-the-art methods in realism and lip synchronization.

02

Produces high-fidelity facial videos with better visual quality.

03

Demonstrates effectiveness through extensive experiments.

Abstract

Talking face generation has a wide range of potential applications in the field of virtual digital humans. However, rendering high-fidelity facial video while ensuring lip synchronization is still a challenge for existing audio-driven talking face generation approaches. To address this issue, we propose HyperLips, a two-stage framework consisting of a hypernetwork for controlling lips and a high-resolution decoder for rendering high-fidelity faces. In the first stage, we construct a base face generation network that uses the hypernetwork to control the encoding latent code of the visual face information over audio. First, FaceEncoder is used to obtain latent code by extracting features from the visual face information taken from the video source containing the face frame.Then, HyperConv, which weighting parameters are updated by HyperNet with the audio features as input, will modify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

semchan/HyperLips
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis

MethodsBalanced Selection · HyperNetwork