MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model
Rahul Medicharla, Alper Yilmaz

TL;DR
MotivNet is a facial emotion recognition model that leverages a foundational vision model, Sapiens, to achieve robust, cross-domain performance without the need for cross-domain training, advancing real-world FER applications.
Contribution
This paper introduces MotivNet as a novel downstream task for Sapiens, enabling generalizable FER without cross-domain training, validated through multiple evaluation criteria.
Findings
MotivNet achieves competitive performance across diverse datasets.
It demonstrates strong generalization in real-world scenarios.
MotivNet is validated as a viable Sapiens downstream task.
Abstract
In this paper, we introduce MotivNet, a generalizable facial emotion recognition model for robust real-world application. Current state-of-the-art FER models tend to have weak generalization when tested on diverse data, leading to deteriorated performance in the real world and hindering FER as a research domain. Though researchers have proposed complex architectures to address this generalization issue, they require training cross-domain to obtain generalizable results, which is inherently contradictory for real-world application. Our model, MotivNet, achieves competitive performance across datasets without cross-domain training by using Meta-Sapiens as a backbone. Sapiens is a human vision foundational model with state-of-the-art generalization in the real world through large-scale pretraining of a Masked Autoencoder. We propose MotivNet as an additional downstream task for Sapiens and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Face and Expression Recognition
