Efficient Neural Architecture Search for Emotion Recognition
Monu Verma, Murari Mandal, Satish Kumar Reddy, Yashwanth Reddy, Meedimale, Santosh Kumar Vipparthi

TL;DR
This paper introduces EmoNAS, a neural architecture search method that designs lightweight, efficient models for both macro and micro facial expression recognition, outperforming existing methods across multiple datasets.
Contribution
First NAS-based approach for simultaneous macro and micro-expression recognition, producing robust, lightweight models using gradient-based architecture search.
Findings
Models outperform state-of-the-art methods on 13 datasets.
Achieves high speed and low space complexity.
Effective in recognizing both macro and micro facial expressions.
Abstract
Automated human emotion recognition from facial expressions is a well-studied problem and still remains a very challenging task. Some efficient or accurate deep learning models have been presented in the literature. However, it is quite difficult to design a model that is both efficient and accurate at the same time. Moreover, identifying the minute feature variations in facial regions for both macro and micro-expressions requires expertise in network design. In this paper, we proposed to search for a highly efficient and robust neural architecture for both macro and micro-level facial expression recognition. To the best of our knowledge, this is the first attempt to design a NAS-based solution for both macro and micro-expression recognition. We produce lightweight models with a gradient-based architecture search algorithm. To maintain consistency between macro and micro-expressions, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Advanced Computing and Algorithms · Face and Expression Recognition
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
