Foundation Model-based Evaluation of Neuropsychiatric Disorders: A Lifespan-Inclusive, Multi-Modal, and Multi-Lingual Study
Zhongren Dong, Haotian Guo, Weixiang Xu, Huan Zhao, Zixing Zhang

TL;DR
This study introduces FEND, a comprehensive multi-modal framework leveraging speech and text data across multiple languages and ages to evaluate neuropsychiatric disorders, highlighting its strengths and limitations.
Contribution
FEND is the first unified, lifespan-inclusive, multi-lingual framework for neuropsychiatric disorder detection using multi-modal data, with extensive benchmarking and analysis.
Findings
Multi-modal fusion improves detection for AD and depression.
Fusion underperforms in ASD due to dataset heterogeneity.
Modality imbalance often limits multi-modal performance.
Abstract
Neuropsychiatric disorders, such as Alzheimer's disease (AD), depression, and autism spectrum disorder (ASD), are characterized by linguistic and acoustic abnormalities, offering potential biomarkers for early detection. Despite the promise of multi-modal approaches, challenges like multi-lingual generalization and the absence of a unified evaluation framework persist. To address these gaps, we propose FEND (Foundation model-based Evaluation of Neuropsychiatric Disorders), a comprehensive multi-modal framework integrating speech and text modalities for detecting AD, depression, and ASD across the lifespan. Leveraging 13 multi-lingual datasets spanning English, Chinese, Greek, French, and Dutch, we systematically evaluate multi-modal fusion performance. Our results show that multi-modal fusion excels in AD and depression detection but underperforms in ASD due to dataset heterogeneity. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Emotion and Mood Recognition · Neurobiology of Language and Bilingualism
