Coherence Constraints in Facial Expression Recognition
Lisa Graziani, Stefano Melacci, Marco Gori

TL;DR
This paper explores the use of coherence constraints in training CNN-based facial expression recognizers, leveraging semi-supervised learning with video data to improve accuracy and robustness, especially under occlusions.
Contribution
It introduces coherence constraints in semi-supervised CNN training for facial expression recognition, enhancing performance by exploiting temporal and part-based prediction consistency.
Findings
Coherence constraints improve recognition accuracy.
Shape-based predictors outperform appearance-based ones with occlusions.
Semi-supervised training effectively utilizes unlabeled video data.
Abstract
Recognizing facial expressions from static images or video sequences is a widely studied but still challenging problem. The recent progresses obtained by deep neural architectures, or by ensembles of heterogeneous models, have shown that integrating multiple input representations leads to state-of-the-art results. In particular, the appearance and the shape of the input face, or the representations of some face parts, are commonly used to boost the quality of the recognizer. This paper investigates the application of Convolutional Neural Networks (CNNs) with the aim of building a versatile recognizer of expressions in static images that can be further applied to video sequences. We first study the importance of different face parts in the recognition task, focussing on appearance and shape-related features. Then we cast the learning problem in the Semi-Supervised setting, exploiting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
