Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Yuezhou Zhang, Amos A Folarin, Judith Dineley, Pauline Conde, Valeria, de Angel, Shaoxiong Sun, Yatharth Ranjan, Zulqarnain Rashid, Callum Stewart,, Petroula Laiou, Heet Sankesara, Linglong Qian, Faith Matcham, Katie M White,, Carolin Oetzmann, Femke Lamers, Sara Siddi

TL;DR
This study uses speech recordings and deep learning to identify language patterns associated with depression, demonstrating that specific topics in speech can serve as indicators of depression severity in real-world data.
Contribution
The paper introduces a novel workflow combining automatic speech recognition and deep learning topic modeling to analyze large-scale speech data for depression indicators.
Findings
Six depression risk topics identified in speech data
Correlation between topic shifts and depression severity over time
Consistent results across different datasets
Abstract
Language use has been shown to correlate with depression, but large-scale validation is needed. Traditional methods like clinic studies are expensive. So, natural language processing has been employed on social media to predict depression, but limitations remain-lack of validated labels, biased user samples, and no context. Our study identified 29 topics in 3919 smartphone-collected speech recordings from 265 participants using the Whisper tool and BERTopic model. Six topics with a median PHQ-8 greater than or equal to 10 were regarded as risk topics for depression: No Expectations, Sleep, Mental Therapy, Haircut, Studying, and Coursework. To elucidate the topic emergence and associations with depression, we compared behavioral (from wearables) and linguistic characteristics across identified topics. The correlation between topic shifts and changes in depression severity over time was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Digital Mental Health Interventions · Mental Health Research Topics
