Leveraging Large Language Models for Suicide Detection on Social Media with Limited Labels
Vy Nguyen, Chau Pham

TL;DR
This paper introduces an ensemble approach using Large Language Models to automatically detect suicidal content on social media, achieving improved accuracy with limited labeled data.
Contribution
It proposes a novel pseudo-labeling method with LLM prompting and combines multiple models for enhanced suicide detection performance.
Findings
Ensemble model improves detection accuracy by 5 percentage points.
Achieves a weight F1 score of 0.770 on the public test set.
Larger LLMs provide better prompting accuracy.
Abstract
The increasing frequency of suicidal thoughts highlights the importance of early detection and intervention. Social media platforms, where users often share personal experiences and seek help, could be utilized to identify individuals at risk. However, the large volume of daily posts makes manual review impractical. This paper explores the use of Large Language Models (LLMs) to automatically detect suicidal content in text-based social media posts. We propose a novel method for generating pseudo-labels for unlabeled data by prompting LLMs, along with traditional classification fine-tuning techniques to enhance label accuracy. To create a strong suicide detection model, we develop an ensemble approach involving prompting with Qwen2-72B-Instruct, and using fine-tuned models such as Llama3-8B, Llama3.1-8B, and Gemma2-9B. We evaluate our approach on the dataset of the Suicide Ideation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing
