Sparse to Dense Dynamic 3D Facial Expression Generation
Naima Otberdout, Claudio Ferrari, Mohamed Daoudi, Stefano, Berretti, Alberto Del Bimbo

TL;DR
This paper introduces a novel method for generating realistic dynamic 3D facial expressions from a neutral face and an expression label, using a combination of landmark-based motion modeling and mesh deformation techniques.
Contribution
It presents a new framework combining Motion3DGAN and S2D-Dec for disentangling identity and expression, improving dynamic 3D facial expression synthesis and mesh reconstruction.
Findings
Significant improvements over previous methods on CoMA and D3DFACS datasets.
Effective disentanglement of identity and expression in 3D face modeling.
Good generalization to unseen data.
Abstract
In this paper, we propose a solution to the task of generating dynamic 3D facial expressions from a neutral 3D face and an expression label. This involves solving two sub-problems: (i)modeling the temporal dynamics of expressions, and (ii) deforming the neutral mesh to obtain the expressive counterpart. We represent the temporal evolution of expressions using the motion of a sparse set of 3D landmarks that we learn to generate by training a manifold-valued GAN (Motion3DGAN). To better encode the expression-induced deformation and disentangle it from the identity information, the generated motion is represented as per-frame displacement from a neutral configuration. To generate the expressive meshes, we train a Sparse2Dense mesh Decoder (S2D-Dec) that maps the landmark displacements to a dense, per-vertex displacement. This allows us to learn how the motion of a sparse set of landmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
