TL;DR
GIF is a novel generative model that combines 3D face modeling with 2D GANs to produce photo-realistic, controllable human face images with interpretable parameters, enabling semantic control over generated faces.
Contribution
We introduce GIF, a new generative model that conditions StyleGAN2 on FLAME parameters, achieving disentangled, controllable face generation with semantic interpretability.
Findings
GIF produces photo-realistic faces with controllable parameters.
Conditioning on rendered FLAME geometry improves generation quality.
Perceptual study confirms GIF's adherence to conditioning parameters.
Abstract
Photo-realistic visualization and animation of expressive human faces have been a long standing challenge. 3D face modeling methods provide parametric control but generates unrealistic images, on the other hand, generative 2D models like GANs (Generative Adversarial Networks) output photo-realistic face images, but lack explicit control. Recent methods gain partial control, either by attempting to disentangle different factors in an unsupervised manner, or by adding control post hoc to a pre-trained model. Unconditional GANs, however, may entangle factors that are hard to undo later. We condition our generative model on pre-defined control parameters to encourage disentanglement in the generation process. Specifically, we condition StyleGAN2 on FLAME, a generative 3D face model. While conditioning on FLAME parameters yields unsatisfactory results, we find that conditioning on rendered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPath Length Regularization · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Convolution · Weight Demodulation · StyleGAN2
