ARPGNet: Appearance- and Relation-aware Parallel Graph Attention Fusion Network for Facial Expression Recognition
Yan Li, Yong Zhao, Xiaohan Xia, Dongmei Jiang

TL;DR
ARPGNet introduces a novel graph attention fusion network that effectively models facial region relationships and appearance dynamics to improve facial expression recognition accuracy.
Contribution
This paper proposes a new parallel graph attention fusion network that jointly learns appearance and relation features for facial expression recognition.
Findings
ARPGNet outperforms existing methods on three datasets.
The relational graph modeling enhances expression recognition accuracy.
Parallel fusion of appearance and relation features improves robustness.
Abstract
The key to facial expression recognition is to learn discriminative spatial-temporal representations that embed facial expression dynamics. Previous studies predominantly rely on pre-trained Convolutional Neural Networks (CNNs) to learn facial appearance representations, overlooking the relationships between facial regions. To address this issue, this paper presents an Appearance- and Relation-aware Parallel Graph attention fusion Network (ARPGNet) to learn mutually enhanced spatial-temporal representations of appearance and relation information. Specifically, we construct a facial region relation graph and leverage the graph attention mechanism to model the relationships between facial regions. The resulting relational representation sequences, along with CNN-based appearance representation sequences, are then fed into a parallel graph attention fusion module for mutual interaction and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Face Recognition and Perception
