Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Mang Ning, Albert Ali Salah, Itir Onal Ertugrul

TL;DR
This paper introduces Face9M, a large facial image dataset, and proposes a novel identity adversarial training method combined with a masked autoencoder to improve facial action unit detection, achieving state-of-the-art results.
Contribution
The paper presents Face9M dataset, a new masked autoencoder model, and a novel identity adversarial training approach for more accurate and identity-invariant facial behavior understanding.
Findings
Face9M dataset improves AU detection performance.
IAT regularization reduces identity bias in models.
FMAE-IAT achieves state-of-the-art F1 scores on multiple datasets.
Abstract
Facial Action Unit (AU) detection has gained significant attention as it enables the breakdown of complex facial expressions into individual muscle movements. In this paper, we revisit two fundamental factors in AU detection: diverse and large-scale data and subject identity regularization. Motivated by recent advances in foundation models, we highlight the importance of data and introduce Face9M, a diverse dataset comprising 9 million facial images from multiple public sources. Pretraining a masked autoencoder on Face9M yields strong performance in AU detection and facial expression tasks. More importantly, we emphasize that the Identity Adversarial Training (IAT) has not been well explored in AU tasks. To fill this gap, we first show that subject identity in AU datasets creates shortcut learning for the model and leads to sub-optimal solutions to AU predictions. Secondly, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition
MethodsSoftmax · Attention Is All You Need
