Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition
Kai Wang, Xiaojiang Peng, Jianfei Yang, Debin Meng, Yu Qiao

TL;DR
This paper introduces a Region Attention Network (RAN) that adaptively focuses on important facial regions to improve facial expression recognition under occlusion and pose variations, validated on new and existing datasets.
Contribution
The paper presents a novel RAN architecture and a region biased loss, along with new in-the-wild datasets, to enhance FER robustness against occlusion and pose changes.
Findings
RAN significantly improves FER accuracy under occlusion and pose variations.
The region biased loss effectively emphasizes key facial regions for expression recognition.
State-of-the-art performance achieved on multiple FER datasets.
Abstract
Occlusion and pose variations, which can change facial appearance significantly, are two major obstacles for automatic Facial Expression Recognition (FER). Though automatic FER has made substantial progresses in the past few decades, occlusion-robust and pose-invariant issues of FER have received relatively less attention, especially in real-world scenarios. This paper addresses the real-world pose and occlusion robust FER problem with three-fold contributions. First, to stimulate the research of FER under real-world occlusions and variant poses, we build several in-the-wild facial expression datasets with manual annotations for the community. Second, we propose a novel Region Attention Network (RAN), to adaptively capture the importance of facial regions for occlusion and pose variant FER. The RAN aggregates and embeds varied number of region features produced by a backbone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Face and Expression Recognition
MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling
