A multimodal Bayesian Network for symptom-level depression and anxiety prediction from voice and speech data
Agnes Norbury, George Fairs, Alexandra L. Georgescu, Matthew M. Nour, Emilia Molimpakis, and Stefano Goria

TL;DR
This paper presents a Bayesian network model that integrates voice and speech features to predict depression and anxiety symptoms, demonstrating high accuracy and fairness in large-scale datasets, aiming to support clinical psychiatric assessments.
Contribution
The study introduces a multimodal Bayesian network approach for symptom-level mental health prediction, addressing integration and interpretability challenges in clinical settings.
Findings
ROC-AUC for depression: 0.842
Model achieves high demographic fairness
Supports clinical supervision with explainable outputs
Abstract
During psychiatric assessment, clinicians observe not only what patients report, but important nonverbal signs such as tone, speech rate, fluency, responsiveness, and body language. Weighing and integrating these different information sources is a challenging task and a good candidate for support by intelligence-driven tools - however this is yet to be realized in the clinic. Here, we argue that several important barriers to adoption can be addressed using Bayesian network modelling. To demonstrate this, we evaluate a model for depression and anxiety symptom prediction from voice and speech features in large-scale datasets (30,135 unique speakers). Alongside performance for conditions and symptoms (for depression, anxiety ROC-AUC=0.842,0.831 ECE=0.018,0.015; core individual symptom ROC-AUC>0.74), we assess demographic fairness and investigate integration across and redundancy between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Machine Learning in Healthcare · Emotion and Mood Recognition
