DepressLLM: Interpretable domain-adapted language model for depression detection from real-world narratives

Sehwan Moon; Aram Lee; Jeong Eun Kim; Hee-Ju Kang; Il-Seon Shin; Sung-Wan Kim; Jae-Min Kim; Min Jhon; Ju-Wan Kim

arXiv:2508.08591·cs.CL·August 13, 2025

DepressLLM: Interpretable domain-adapted language model for depression detection from real-world narratives

Sehwan Moon, Aram Lee, Jeong Eun Kim, Hee-Ju Kang, Il-Seon Shin, Sung-Wan Kim, Jae-Min Kim, Min Jhon, Ju-Wan Kim

PDF

TL;DR

DepressLLM is an interpretable language model trained on autobiographical narratives that improves depression detection accuracy and confidence estimation, demonstrating potential for early psychiatric diagnosis.

Contribution

Introduces DepressLLM with a novel dataset and the SToPS module for interpretable, high-performance depression prediction across diverse real-world data.

Findings

01

Achieved an AUC of 0.789, rising to 0.904 with high-confidence samples.

02

Validated robustness on multiple datasets including EMA and clinical interviews.

03

Identified key limitations through psychiatric review of misclassifications.

Abstract

Advances in large language models (LLMs) have enabled a wide range of applications. However, depression prediction is hindered by the lack of large-scale, high-quality, and rigorously annotated datasets. This study introduces DepressLLM, trained and evaluated on a novel corpus of 3,699 autobiographical narratives reflecting both happiness and distress. DepressLLM provides interpretable depression predictions and, via its Score-guided Token Probability Summation (SToPS) module, delivers both improved classification performance and reliable confidence estimates, achieving an AUC of 0.789, which rises to 0.904 on samples with confidence $\geq$ 0.95. To validate its robustness to heterogeneous data, we evaluated DepressLLM on in-house datasets, including an Ecological Momentary Assessment (EMA) corpus of daily stress and mood recordings, and on public clinical interview data. Finally, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.