Cross-subject Action Unit Detection with Meta Learning and Transformer-based Relation Modeling
Jiyuan Cao, Zhilei Liu, Yong Zhang

TL;DR
This paper introduces a novel meta-learning and transformer-based approach for cross-subject facial Action Unit detection, effectively reducing identity bias and capturing AU relationships to improve accuracy on public datasets.
Contribution
The paper proposes a meta-learning-based AU representation learning method combined with transformer-based relation modeling, enhancing cross-subject AU detection accuracy.
Findings
Improved F1 scores by 1.3% on BP4D dataset
Achieved better generalization across subjects
Outperformed state-of-the-art methods
Abstract
Facial Action Unit (AU) detection is a crucial task for emotion analysis from facial movements. The apparent differences of different subjects sometimes mislead changes brought by AUs, resulting in inaccurate results. However, most of the existing AU detection methods based on deep learning didn't consider the identity information of different subjects. The paper proposes a meta-learning-based cross-subject AU detection model to eliminate the identity-caused differences. Besides, a transformer-based relation learning module is introduced to learn the latent relations of multiple AUs. To be specific, our proposed work is composed of two sub-tasks. The first sub-task is meta-learning-based AU local region representation learning, called MARL, which learns discriminative representation of local AU regions that incorporates the shared information of multiple subjects and eliminates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Gaze Tracking and Assistive Technology · Advanced Computing and Algorithms
