Self-Supervised Embeddings for Detecting Individual Symptoms of Depression
Sri Harsha Dumpala, Katerina Dikaios, Abraham Nunes, Frank Rudzicz,, Rudolf Uher, Sageev Oore

TL;DR
This paper introduces a method using self-supervised speech embeddings to detect individual depression symptoms and severity, outperforming traditional features, and emphasizes the importance of multi-task learning and diverse SSL models.
Contribution
It is the first to utilize SSL embeddings for simultaneous depression symptom detection and severity prediction, exploring various SSL models and multi-task learning strategies.
Findings
SSL embeddings outperform conventional speech features
Combining multiple SSL models enhances detection accuracy
Multi-task learning improves symptom identification
Abstract
Depression, a prevalent mental health disorder impacting millions globally, demands reliable assessment systems. Unlike previous studies that focus solely on either detecting depression or predicting its severity, our work identifies individual symptoms of depression while also predicting its severity using speech input. We leverage self-supervised learning (SSL)-based speech models to better utilize the small-sized datasets that are frequently encountered in this task. Our study demonstrates notable performance improvements by utilizing SSL embeddings compared to conventional speech features. We compare various types of SSL pretrained models to elucidate the type of speech information (semantic, speaker, or prosodic) that contributes the most in identifying different symptoms. Additionally, we evaluate the impact of combining multiple SSL embeddings on performance. Furthermore, we show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics
MethodsFocus
