A Multi-Stage Large Language Model Framework for Extracting Suicide-Related Social Determinants of Health

Song Wang; Yishu Wei; Haotian Ma; Max Lovitt; Kelly Deng; Yuan Meng; Zihan Xu; Jingze Zhang; Yunyu Xiao; Ying Ding; Xuhai Xu; Joydeep Ghosh; Yifan Peng

arXiv:2508.05003·cs.CL·August 8, 2025

A Multi-Stage Large Language Model Framework for Extracting Suicide-Related Social Determinants of Health

Song Wang, Yishu Wei, Haotian Ma, Max Lovitt, Kelly Deng, Yuan Meng, Zihan Xu, Jingze Zhang, Yunyu Xiao, Ying Ding, Xuhai Xu, Joydeep Ghosh, Yifan Peng

PDF

TL;DR

This paper introduces a multi-stage large language model framework that improves the extraction of suicide-related social determinants of health from unstructured text, enhancing accuracy, explainability, and efficiency for early intervention.

Contribution

The paper presents a novel multi-stage LLM framework that outperforms existing models in extracting SDoH factors related to suicide, with improved explainability and reduced inference costs.

Findings

01

Enhanced SDoH extraction accuracy and context retrieval.

02

Fine-tuning smaller models achieves comparable performance.

03

Intermediate explanations improve model transparency.

Abstract

Background: Understanding social determinants of health (SDoH) factors contributing to suicide incidents is crucial for early intervention and prevention. However, data-driven approaches to this goal face challenges such as long-tailed factor distributions, analyzing pivotal stressors preceding suicide incidents, and limited model explainability. Methods: We present a multi-stage large language model framework to enhance SDoH factor extraction from unstructured text. Our approach was compared to other state-of-the-art language models (i.e., pre-trained BioBERT and GPT-3.5-turbo) and reasoning models (i.e., DeepSeek-R1). We also evaluated how the model's explanations help people annotate SDoH factors more quickly and accurately. The analysis included both automated comparisons and a pilot user study. Results: We show that our proposed framework demonstrated performance boosts in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.