SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese   Social Media Analysis

Hongzhi Qi; Hanfei Liu; Jianqiang Li; Qing Zhao; Wei Zhai; Dan Luo,; Tian Yu He; Shuo Liu; Bing Xiang Yang; Guanghui Fu

arXiv:2404.12659·cs.CL·April 22, 2024·1 cites

SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis

Hongzhi Qi, Hanfei Liu, Jianqiang Li, Qing Zhao, Wei Zhai, Dan Luo,, Tian Yu He, Shuo Liu, Bing Xiang Yang, Guanghui Fu

PDF

Open Access 1 Repo

TL;DR

This paper introduces SOS-1K, a Chinese social media dataset for fine-grained suicide risk classification, evaluates multiple models, and explores data augmentation techniques to improve detection accuracy.

Contribution

It provides the first Chinese dataset for detailed suicide risk levels and demonstrates the effectiveness of domain-specific pre-trained models and data augmentation methods.

Findings

01

Deep learning models achieve 88.39% F1 in high/low risk classification.

02

Fine-grained classification results are modest, with 50.89% F1.

03

Data augmentation improves model performance by up to 4.65%.

Abstract

In the social media, users frequently express personal emotions, a subset of which may indicate potential suicidal tendencies. The implicit and varied forms of expression in internet language complicate accurate and rapid identification of suicidal intent on social media, thus creating challenges for timely intervention efforts. The development of deep learning models for suicide risk detection is a promising solution, but there is a notable lack of relevant datasets, especially in the Chinese context. To address this gap, this study presents a Chinese social media dataset designed for fine-grained suicide risk classification, focusing on indicators such as expressions of suicide intent, methods of suicide, and urgency of timing. Seven pre-trained models were evaluated in two tasks: high and low suicide risk, and fine-grained suicide risk classification on a level of 0 to 10. In our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hongzhiq/finegrainedsuicidedetection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Computational and Text Analysis Methods