Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge

Yupei Li; Shuaijie Shao; Manuel Milling; and Bj\"orn W. Schuller

arXiv:2505.22863·cs.HC·August 27, 2025

Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge

Yupei Li, Shuaijie Shao, Manuel Milling, and Bj\"orn W. Schuller

PDF

1 Repo

TL;DR

This paper introduces a multimodal depression detection method combining large language models with audio features and psychological knowledge, improving diagnostic accuracy over previous approaches.

Contribution

It is the first to apply LLMs to multimodal depression detection, integrating psychological expertise and audio features for enhanced accuracy.

Findings

01

Improved MAE and RMSE in depression detection

02

Effective integration of psychological knowledge into LLMs

03

Utilization of Wav2Vec for audio feature extraction

Abstract

Depression is a growing concern gaining attention in both public discourse and AI research. While deep neural networks (DNNs) have been used for recognition, they still lack real-world effectiveness. Large language models (LLMs) show strong potential but require domain-specific fine-tuning and struggle with non-textual cues. Since depression is often expressed through vocal tone and behaviour rather than explicit text, relying on language alone is insufficient. Diagnostic accuracy also suffers without incorporating psychological expertise. To address these limitations, we present, to the best of our knowledge, the first application of LLMs to multimodal depression detection using the DAIC-WOZ dataset. We extract the audio features using the pre-trained model Wav2Vec, and mapped it to text-based LLMs for further processing. We also propose a novel strategy for incorporating psychological…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

myxp-lyp/depression-detection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need · Balanced Selection · Sparse Evolutionary Training