Deep Spatiotemporal Representation of the Face for Automatic Pain Intensity Estimation
Mohammad Tavakolian, Abdenour Hadid

TL;DR
This paper introduces a novel 3D deep learning model that captures complex spatiotemporal facial variations for automatic pain intensity estimation, demonstrating promising results on a standard dataset.
Contribution
The paper presents a new 3D deep architecture with variable temporal depths to improve facial pain intensity assessment from spontaneous expressions.
Findings
Achieved promising performance on UNBC-McMaster dataset
Effectively captures a wide range of facial spatiotemporal variations
Outperforms existing methods in automatic pain estimation
Abstract
Automatic pain intensity assessment has a high value in disease diagnosis applications. Inspired by the fact that many diseases and brain disorders can interrupt normal facial expression formation, we aim to develop a computational model for automatic pain intensity assessment from spontaneous and micro facial variations. For this purpose, we propose a 3D deep architecture for dynamic facial video representation. The proposed model is built by stacking several convolutional modules where each module encompasses a 3D convolution kernel with a fixed temporal depth, several parallel 3D convolutional kernels with different temporal depths, and an average pooling layer. Deploying variable temporal depths in the proposed architecture allows the model to effectively capture a wide range of spatiotemporal variations on the faces. Extensive experiments on the UNBC-McMaster Shoulder Pain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods3D Convolution · Average Pooling · Convolution
