MD-Face: MoE-Enhanced Label-Free Disentangled Representation for Interactive Facial Attribute Editing
Xuan Cui, Yunfei Zhao, Bo Liu, Wei Duan, Xingrong Fan

TL;DR
MD-Face introduces a label-free, MoE-based framework for disentangled facial attribute editing, reducing attribute entanglement without requiring labeled data and enabling efficient, high-quality interactive editing.
Contribution
It proposes a novel MoE-based disentangled representation learning method with geometry-aware loss, achieving competitive results without labeled data.
Findings
Outperforms unsupervised baselines in attribute disentanglement.
Achieves comparable performance to supervised methods.
Offers better image quality and lower latency than diffusion-based approaches.
Abstract
GAN-based facial attribute editing is widely used in virtual avatars and social media but often suffers from attribute entanglement, where modifying one face attribute unintentionally alters others. While supervised disentangled representation learning can address this, it relies heavily on labeled data, incurring high annotation costs. To address these challenges, we propose MD-Face, a label-free disentangled representation learning framework based on Mixture of Experts (MoE). MD-Face utilizes a MoE backbone with a gating mechanism that dynamically allocates experts, enabling the model to learn semantic vectors with greater independence. To further enhance attribute entanglement, we introduce a geometry-aware loss, which aligns each semantic vector with its corresponding Semantic Boundary Vector (SBV) through a Jacobian-based pushforward method. Experiments with ProGAN and StyleGAN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
