TL;DR
This paper introduces a novel generative model that disentangles and controls facial attributes like gaze and head orientation, improving image synthesis and downstream gaze estimation, especially with limited real data.
Contribution
The paper presents a new architecture for disentangling appearance factors in face images, enhancing control over gaze and head orientation for better synthesis and estimation.
Findings
Outperforms state-of-the-art in redirection accuracy
Improves disentanglement of gaze and head orientation
Enhances semi-supervised gaze estimation with limited data
Abstract
Many computer vision tasks rely on labeled data. Rapid progress in generative modeling has led to the ability to synthesize photorealistic images. However, controlling specific aspects of the generation process such that the data can be used for supervision of downstream tasks remains challenging. In this paper we propose a novel generative model for images of faces, that is capable of producing high-quality images under fine-grained control over eye gaze and head orientation angles. This requires the disentangling of many appearance related factors including gaze and head orientation but also lighting, hue etc. We propose a novel architecture which learns to discover, disentangle and encode these extraneous variations in a self-learned manner. We further show that explicitly disentangling task-irrelevant factors results in more accurate modelling of gaze and head orientation. A novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsSelf-Learning
