Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Amandeep Kumar, Muhammad Awais, Sanath Narayan, Hisham, Cholakkal, Salman Khan, Rao Muhammad Anwer

TL;DR
This paper introduces a scalable, text-guided 3D-aware facial image editing framework that uses attribute-specific prompt learning to control facial attributes across various poses, ensuring view consistency and identity preservation.
Contribution
It proposes a novel, plug-and-play, text-driven style token-based latent attribute editor (LAE) that enables scalable, attribute-specific editing in 3D-aware GANs using a pre-trained vision-language model.
Findings
High-quality, view-consistent facial image editing across multiple attributes.
Effective attribute control using text-guided prompts in 3D-aware face generation.
Scalable approach avoiding separate models for each attribute.
Abstract
Drawing upon StyleGAN's expressivity and disentangled latent space, existing 2D approaches employ textual prompting to edit facial images with different attributes. In contrast, 3D-aware approaches that generate faces at different target poses require attribute-specific classifiers, learning separate model weights for each attribute, and are not scalable for novel attributes. In this work, we propose an efficient, plug-and-play, 3D-aware face editing framework based on attribute-specific prompt learning, enabling the generation of facial images with controllable attributes across various target poses. To this end, we introduce a text-driven learnable style token-based latent attribute editor (LAE). The LAE harnesses a pre-trained vision-language model to find text-guided attribute-specific editing direction in the latent space of any pre-trained 3D-aware GAN. It utilizes learnable style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Facial Nerve Paralysis Treatment and Research · Face and Expression Recognition
