TL;DR
This paper introduces a new dataset and a multi-modal deep learning approach to automatically detect psychological distress through body gestures, self-adaptors, and audio-visual cues, advancing research in non-verbal distress analysis.
Contribution
It presents a novel dataset of full body videos with distress labels and a new method for detecting self-adaptors and fidgeting, improving automatic distress prediction.
Findings
The proposed model accurately predicts distress levels using combined audio-visual and behavioral features.
Automatic detection of fidgeting correlates with psychological distress.
Multi-modal deep learning outperforms single-modality approaches.
Abstract
Psychological distress is a significant and growing issue in society. Automatic detection, assessment, and analysis of such distress is an active area of research. Compared to modalities such as face, head, and vocal, research investigating the use of the body modality for these tasks is relatively sparse. This is, in part, due to the limited available datasets and difficulty in automatically extracting useful body features. Recent advances in pose estimation and deep learning have enabled new approaches to this modality and domain. To enable this research, we have collected and analyzed a new dataset containing full body videos for short interviews and self-reported distress labels. We propose a novel method to automatically detect self-adaptors and fidgeting, a subset of self-adaptors that has been shown to be correlated with psychological distress. We perform analysis on statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
