Facial Expression Video Generation Based-On Spatio-temporal Convolutional GAN: FEV-GAN
Hamza Bouzid, Lahoucine Ballihi

TL;DR
This paper introduces FEV-GAN, a novel spatio-temporal convolutional GAN that generates facial expression videos from a single image, effectively preserving identity and improving video quality.
Contribution
The work proposes a dual-encoder generator architecture that leverages identity and spatial features to enhance facial expression video synthesis.
Findings
Effective generation of six basic facial expressions
High identity preservation in synthesized videos
Improved image quality over previous methods
Abstract
Facial expression generation has always been an intriguing task for scientists and researchers all over the globe. In this context, we present our novel approach for generating videos of the six basic facial expressions. Starting from a single neutral facial image and a label indicating the desired facial expression, we aim to synthesize a video of the given identity performing the specified facial expression. Our approach, referred to as FEV-GAN (Facial Expression Video GAN), is based on Spatio-temporal Convolutional GANs, that are known to model both content and motion in the same network. Previous methods based on such a network have shown a good ability to generate coherent videos with smooth temporal evolution. However, they still suffer from low image quality and low identity preservation capability. In this work, we address this problem by using a generator composed of two image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
