Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting
Lasse Hansen, Roberta Rocca, Arndis Simonsen, Alberto Parola, Vibeke, Bliksted, Nicolai Ladegaard, Dan Bang, Kristian Tyl\'en, Ethan Weed, S{\o}ren, Dinesen {\O}stergaard, Riccardo Fusaroli

TL;DR
This study evaluates machine learning models, including Transformer-based approaches, for classifying neuropsychiatric conditions using speech and text features in multiclass settings, highlighting challenges and future directions.
Contribution
It introduces a multiclass dataset with multiple neuropsychiatric diagnoses and compares the performance of various models, emphasizing the need for larger, more detailed datasets.
Findings
Binary models perform well for single diagnoses
Multiclass classification accuracy drops significantly
Combining voice and text features improves performance
Abstract
Speech patterns have been identified as potential diagnostic markers for neuropsychiatric conditions. However, most studies only compare a single clinical group to healthy controls, whereas clinical practice often requires differentiating between multiple potential diagnoses (multiclass settings). To address this, we assembled a dataset of repeated recordings from 420 participants (67 with major depressive disorder, 106 with schizophrenia and 46 with autism, as well as matched controls), and tested the performance of a range of conventional machine learning models and advanced Transformer models on both binary and multiclass classification, based on voice and text features. While binary models performed comparably to previous research (F1 scores between 0.54-0.75 for autism spectrum disorder, ASD; 0.67-0.92 for major depressive disorder, MDD; and 0.71-0.83 for schizophrenia); when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Stuttering Research and Treatment
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dropout · Softmax · Adam · Byte Pair Encoding · Residual Connection · Label Smoothing
