Pose-adaptive Hierarchical Attention Network for Facial Expression Recognition
Yuanyuan Liu, Jiyao Peng, Jiabei Zeng, and Shiguang Shan

TL;DR
This paper introduces PhaNet, a novel pose-adaptive hierarchical attention network that jointly recognizes facial expressions and poses, effectively handling pose variations in unconstrained environments through end-to-end training.
Contribution
PhaNet is the first end-to-end model that adaptively learns pose-invariant and expression-discriminative features using hierarchical attention, outperforming existing methods on multiple datasets.
Findings
Achieves state-of-the-art accuracy on multiple datasets.
Effectively handles pose variations without separate pose normalization.
Outperforms existing methods in both within-dataset and cross-dataset evaluations.
Abstract
Multi-view facial expression recognition (FER) is a challenging task because the appearance of an expression varies in poses. To alleviate the influences of poses, recent methods either perform pose normalization or learn separate FER classifiers for each pose. However, these methods usually have two stages and rely on good performance of pose estimators. Different from existing methods, we propose a pose-adaptive hierarchical attention network (PhaNet) that can jointly recognize the facial expressions and poses in unconstrained environment. Specifically, PhaNet discovers the most relevant regions to the facial expression by an attention mechanism in hierarchical scales, and the most informative scales are then selected to learn the pose-invariant and expression-discriminative representations. PhaNet is end-to-end trainable by minimizing the hierarchical attention losses, the FER loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Hand Gesture Recognition Systems
