TL;DR
This paper introduces a novel graph-based method that explicitly models complex relationships between facial action units using multi-dimensional edge features, significantly improving recognition accuracy.
Contribution
It proposes a deep learning approach that encodes AU relationships with unique graphs and multi-dimensional edges, enhancing AU recognition performance.
Findings
Achieves state-of-the-art results on BP4D and DISFA datasets.
Improves performance of CNN and transformer backbones with the proposed modules.
Demonstrates strong modeling of AU relationship cues for recognition.
Abstract
The activations of Facial Action Units (AUs) mutually influence one another. While the relationship between a pair of AUs can be complex and unique, existing approaches fail to specifically and explicitly represent such cues for each pair of AUs in each facial display. This paper proposes an AU relationship modelling approach that deep learns a unique graph to explicitly describe the relationship between each pair of AUs of the target facial display. Our approach first encodes each AU's activation status and its association with other AUs into a node feature. Then, it learns a pair of multi-dimensional edge features to describe multiple task-specific relationship cues between each pair of AUs. During both node and edge feature learning, our approach also considers the influence of the unique facial display on AUs' relationship by taking the full face representation as an input.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
