RSD-15K: A Large-Scale User-Level Annotated Dataset for Suicide Risk Detection on Social Media
Shouwen Zheng, Yingzhi Tao, Taiqi Zhou

TL;DR
This paper introduces RSD-15K, a large-scale, user-level annotated social media dataset for suicide risk detection, supporting dynamic risk modeling and offering insights for mental health applications.
Contribution
It provides a comprehensive, time-sequenced dataset with rigorous annotations, enabling improved modeling of suicide risk evolution and benchmarking of various detection methods.
Findings
Deep learning models outperform traditional methods.
Large language models show promising results.
Dataset effectively supports automatic suicide risk assessment.
Abstract
In recent years, cognitive and mental health (CMH) disorders have increasingly become an important challenge for global public health, especially the suicide problem caused by multiple factors such as social competition, economic pressure and interpersonal relationships among young and middle-aged people. Social media, as an important platform for individuals to express emotions and seek help, provides the possibility for early detection and intervention of suicide risk. This paper introduces a large-scale dataset containing 15,000 user-level posts. Compared with existing datasets, this dataset retains complete user posting time sequence information, supports modeling the dynamic evolution of suicide risk, and we have also conducted comprehensive and rigorous annotations on these datasets. In the benchmark experiment, we systematically evaluated the performance of traditional machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing
