HSEmotion Team at the 7th ABAW Challenge: Multi-Task Learning and Compound Facial Expression Recognition
Andrey V. Savchenko

TL;DR
The HSEmotion team developed a multi-task learning pipeline using lightweight neural networks for facial expression and affect recognition, achieving significant accuracy improvements while maintaining privacy and enabling mobile deployment.
Contribution
We introduced an efficient, privacy-aware multi-task facial analysis pipeline with simple post-processing techniques that significantly enhance recognition accuracy.
Findings
F1-score improved by up to 7% with score smoothing.
Valence and arousal CCC increased by up to 1.25 times.
Performance score on validation set was 4.5 times higher than baseline.
Abstract
In this paper, we describe the results of the HSEmotion team in two tasks of the seventh Affective Behavior Analysis in-the-wild (ABAW) competition, namely, multi-task learning for simultaneous prediction of facial expression, valence, arousal, and detection of action units, and compound expression recognition. We propose an efficient pipeline based on frame-level facial feature extractors pre-trained in multi-task settings to estimate valence-arousal and basic facial expressions given a facial photo. We ensure the privacy-awareness of our techniques by using the lightweight architectures of neural networks, such as MT-EmotiDDAMFN, MT-EmotiEffNet, and MT-EmotiMobileFaceNet, that can run even on a mobile device without the need to send facial video to a remote server. It was demonstrated that a significant step in improving the overall accuracy is the smoothing of neural network output…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition
MethodsSparse Evolutionary Training
