Facial Expression Video Generation Based-On Spatio-temporal   Convolutional GAN: FEV-GAN

Hamza Bouzid; Lahoucine Ballihi

arXiv:2210.11182·cs.CV·July 4, 2023

Facial Expression Video Generation Based-On Spatio-temporal Convolutional GAN: FEV-GAN

Hamza Bouzid, Lahoucine Ballihi

PDF

TL;DR

This paper introduces FEV-GAN, a novel spatio-temporal convolutional GAN that generates facial expression videos from a single image, effectively preserving identity and improving video quality.

Contribution

The work proposes a dual-encoder generator architecture that leverages identity and spatial features to enhance facial expression video synthesis.

Findings

01

Effective generation of six basic facial expressions

02

High identity preservation in synthesized videos

03

Improved image quality over previous methods

Abstract

Facial expression generation has always been an intriguing task for scientists and researchers all over the globe. In this context, we present our novel approach for generating videos of the six basic facial expressions. Starting from a single neutral facial image and a label indicating the desired facial expression, we aim to synthesize a video of the given identity performing the specified facial expression. Our approach, referred to as FEV-GAN (Facial Expression Video GAN), is based on Spatio-temporal Convolutional GANs, that are known to model both content and motion in the same network. Previous methods based on such a network have shown a good ability to generate coherent videos with smooth temporal evolution. However, they still suffer from low image quality and low identity preservation capability. In this work, we address this problem by using a generator composed of two image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.