When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning Framework
Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang

TL;DR
This paper introduces a novel two-stage learning framework called EGS-Net for few-shot compound facial expression recognition, leveraging emotion guidance to improve generalization to unseen expressions.
Contribution
The paper proposes EGS-Net, a joint and alternating learning framework that enhances few-shot compound FER by preventing overfitting and improving inference on unseen expressions.
Findings
EGS-Net outperforms state-of-the-art methods on multiple datasets.
Emotion guidance effectively regularizes the similarity learning.
Two-stage training improves generalization to novel expressions.
Abstract
Human emotions involve basic and compound facial expressions. However, current research on facial expression recognition (FER) mainly focuses on basic expressions, and thus fails to address the diversity of human emotions in practical scenarios. Meanwhile, existing work on compound FER relies heavily on abundant labeled compound expression training data, which are often laboriously collected under the professional instruction of psychology. In this paper, we study compound FER in the cross-domain few-shot learning setting, where only a few images of novel classes from the target domain are required as a reference. In particular, we aim to identify unseen compound expressions with the model trained on easily accessible basic expression datasets. To alleviate the problem of limited base classes in our FER task, we propose a novel Emotion Guided Similarity Network (EGS-Net), consisting of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEmotion and Mood Recognition · Face and Expression Recognition · Machine Learning and ELM
MethodsBalanced Selection
