FacialPulse: An Efficient RNN-based Depression Detection via Temporal Facial Landmarks
Ruiqi Wang, Jinyang Huang, Jie Zhang, Xin Liu, Xiang Zhang, Zhi Liu,, Peng Zhao, Sigui Chen, Xiao Sun

TL;DR
FacialPulse is a novel RNN-based framework that efficiently detects depression by capturing temporal facial dynamics and landmarks, outperforming existing methods in accuracy and speed.
Contribution
The paper introduces FacialPulse, combining a new temporal facial motion module and landmark calibration to improve depression detection accuracy and computational efficiency.
Findings
Decreased MAE by 21% over baselines
Doubled recognition speed compared to state-of-the-art
Effective use of facial landmarks reduces redundancy
Abstract
Depression is a prevalent mental health disorder that significantly impacts individuals' lives and well-being. Early detection and intervention are crucial for effective treatment and management of depression. Recently, there are many end-to-end deep learning methods leveraging the facial expression features for automatic depression detection. However, most current methods overlook the temporal dynamics of facial expressions. Although very recent 3DCNN methods remedy this gap, they introduce more computational cost due to the selection of CNN-based backbones and redundant facial features. To address the above limitations, by considering the timing correlation of facial expressions, we propose a novel framework called FacialPulse, which recognizes depression with high accuracy and speed. By harnessing the bidirectional nature and proficiently addressing long-term dependencies, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Mental Health via Writing
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Masked autoencoder
