Facial Expression Analysis Using Decomposed Multiscale Spatiotemporal Networks
Wheidima Carneiro de Melo, Eric Granger, Miguel Bordallo Lopez

TL;DR
This paper introduces a decomposed multiscale spatiotemporal network (DMSN) for facial expression analysis, reducing computational complexity while effectively capturing diverse facial dynamics for health-related state inference.
Contribution
The paper proposes a novel DMSN architecture with three variants, enabling efficient multiscale feature extraction for facial expression analysis, adaptable to different facial behavior complexities.
Findings
DMSN-C is effective for depression detection.
DMSN-A is efficient for pain estimation.
DMSN offers a cost-effective solution for various facial expression complexities.
Abstract
Video-based analysis of facial expressions has been increasingly applied to infer health states of individuals, such as depression and pain. Among the existing approaches, deep learning models composed of structures for multiscale spatiotemporal processing have shown strong potential for encoding facial dynamics. However, such models have high computational complexity, making for a difficult deployment of these solutions. To address this issue, we introduce a new technique to decompose the extraction of multiscale spatiotemporal features. Particularly, a building block structure called Decomposed Multiscale Spatiotemporal Network (DMSN) is presented along with three variants: DMSN-A, DMSN-B, and DMSN-C blocks. The DMSN-A block generates multiscale representations by analyzing spatiotemporal features at multiple temporal ranges, while the DMSN-B block analyzes spatiotemporal features at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Face Recognition and Perception
