Depression Risk Assessment in Social Media via Large Language Models
Giorgia Gulino, Manuel Petrucci

TL;DR
This paper presents a scalable, LLM-based system for assessing depression risk in social media posts, demonstrating competitive performance and stable risk profiles across communities.
Contribution
It introduces a novel LLM-based method for depression risk assessment using multi-label emotion classification and severity indexing on Reddit data.
Findings
Best model achieves micro-F1 = 0.75 and macro-F1 = 0.70.
Method performs competitively with fine-tuned models like BART.
Risk profiles are consistent and stable over time across communities.
Abstract
Depression is one of the most prevalent and debilitating mental health conditions worldwide, frequently underdiagnosed and undertreated. The proliferation of social media platforms provides a rich source of naturalistic linguistic signals for the automated monitoring of psychological well-being. In this work, we propose a system based on Large Language Models (LLMs) for depression risk assessment in Reddit posts, through multi-label classification of eight depression-associated emotions and the computation of a weighted severity index. The method is evaluated in a zero-shot setting on the annotated DepressionEmo dataset (~6,000 posts) and applied in-the-wild to 469,692 comments collected from four subreddits over the period 2024-2025. Our best model, gemma3:27b, achieves micro-F1 = 0.75 and macro-F1 = 0.70, results competitive with purpose-built fine-tuned models (BART: micro-F1 = 0.80,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
