Most Important Person-guided Dual-branch Cross-Patch Attention for Group Affect Recognition
Hongxia Xie, Ming-Xian Lee, Tzu-Jui Chen, Hung-Jen Chen, Hou-I Liu,, Hong-Han Shuai, Wen-Huang Cheng

TL;DR
This paper introduces a novel dual-branch attention transformer that leverages the Most Important Person concept to improve group affect recognition by effectively integrating individual and global contextual cues.
Contribution
It proposes the Dual-branch Cross-Patch Attention Transformer (DCAT) that fuses MIP and global features, addressing the lack of affective context consideration in previous methods.
Findings
Outperforms state-of-the-art on multiple datasets
Effective in transfer to related tasks like group cohesion
Demonstrates the importance of MIP in affect recognition
Abstract
Group affect refers to the subjective emotion that is evoked by an external stimulus in a group, which is an important factor that shapes group behavior and outcomes. Recognizing group affect involves identifying important individuals and salient objects among a crowd that can evoke emotions. However, most existing methods lack attention to affective meaning in group dynamics and fail to account for the contextual relevance of faces and objects in group-level images. In this work, we propose a solution by incorporating the psychological concept of the Most Important Person (MIP), which represents the most noteworthy face in a crowd and has affective semantic meaning. We present the Dual-branch Cross-Patch Attention Transformer (DCAT) which uses global image and MIP together as inputs. Specifically, we first learn the informative facial regions produced by the MIP and the global context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Dropout · Byte Pair Encoding · Linear Layer · Dense Connections · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection
