Multi-Task Learning Framework for Emotion Recognition in-the-wild
Tenggan Zhang, Chuanhe Liu, Xiaolong Liu, Yuchen Liu, Liyu Meng, Lei, Sun, Wenqiang Jiang, Fengyuan Zhang, Jinming Zhao, Qin Jin

TL;DR
This paper introduces a multi-task learning system for emotion recognition in videos, combining unsupervised and supervised visual feature extraction with various temporal encoders to improve affective analysis.
Contribution
The paper proposes a novel multi-task learning framework integrating multiple feature extraction and temporal encoding methods for affective behavior analysis.
Findings
Achieved top performance in the MTL Challenge with a score of 1.7607 on validation.
Effectively modeled correlations between valence, arousal, expression, and AU tasks.
Demonstrated the effectiveness of combining unsupervised and supervised learning for visual features.
Abstract
This paper presents our system for the Multi-Task Learning (MTL) Challenge in the 4th Affective Behavior Analysis in-the-wild (ABAW) competition. We explore the research problems of this challenge from three aspects: 1) For obtaining efficient and robust visual feature representations, we propose MAE-based unsupervised representation learning and IResNet/DenseNet-based supervised representation learning methods; 2) Considering the importance of temporal information in videos, we explore three types of sequential encoders to capture the temporal information, including the encoder based on transformer, the encoder based on LSTM, and the encoder based on GRU; 3) For modeling the correlation between these different tasks (i.e., valence, arousal, expression, and AU) for multi-task affective analysis, we first explore the dependency between these different tasks and propose three multi-task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition
MethodsTest · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Gated Recurrent Unit
