Uncovering Intervention Opportunities for Suicide Prevention with Language Model Assistants
Jaspreet Ranjit, Hyundong J. Cho, Claire J. Smerdon, Yoonsoo Nam, Myles Phung, Jonathan May, John R. Blosnich, Swabha Swayamdipta

TL;DR
This study explores how language models can assist in suicide prevention research by improving data annotation efficiency and accuracy in sensitive public health datasets, potentially leading to better intervention strategies.
Contribution
It introduces a human-in-the-loop algorithm leveraging language models to streamline and enhance the annotation process for suicide-related data variables.
Findings
LM predictions match 85% of existing annotations
LM assists reveal 38% of annotation discrepancies
Algorithm achieves comparable quality with manual annotation
Abstract
Warning: This paper discusses topics of suicide and suicidal ideation, which may be distressing to some readers. The National Violent Death Reporting System (NVDRS) documents information about suicides in the United States, including free text narratives (e.g., circumstances surrounding a suicide). In a demanding public health data pipeline, annotators manually extract structured information from death investigation records following extensive guidelines developed painstakingly by experts. In this work, we facilitate data-driven insights from the NVDRS data to support the development of novel suicide interventions by investigating the value of language models (LMs) as efficient assistants to these (a) data annotators and (b) experts. We find that LM predictions match existing data annotations about 85% of the time across 50 NVDRS variables. In the cases where the LM disagrees with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Suicide and Self-Harm Studies · Machine Learning in Healthcare
