Harmful Suicide Content Detection

Kyumin Park; Myung Jae Baik; YeongJun Hwang; Yen Shin; HoJae Lee; Ruda; Lee; Sang Min Lee; Je Young Hannah Sun; Ah Rah Lee; Si Yeun Yoon; Dong-ho; Lee; Jihyung Moon; JinYeong Bak; Kyunghyun Cho; Jong-Woo Paik; Sungjoon; Park

arXiv:2407.13942·cs.CY·July 22, 2024

Harmful Suicide Content Detection

Kyumin Park, Myung Jae Baik, YeongJun Hwang, Yen Shin, HoJae Lee, Ruda, Lee, Sang Min Lee, Je Young Hannah Sun, Ah Rah Lee, Si Yeun Yoon, Dong-ho, Lee, Jihyung Moon, JinYeong Bak, Kyunghyun Cho, Jong-Woo Paik, Sungjoon, Park

PDF

Open Access

TL;DR

This paper introduces a new multi-modal benchmark and detection task for classifying harmful online suicide content into five levels, utilizing large language models and expert annotations to improve moderation and safety.

Contribution

It proposes a novel harmful suicide content detection task, develops a Korean multi-modal benchmark with expert labels, and explores LLM-based strategies for content moderation.

Findings

01

Developed a multi-modal Korean benchmark with expert annotations.

02

Explored LLM strategies for detecting harmful content.

03

Publicized the benchmark with ethical verification.

Abstract

Harmful suicide content on the Internet is a significant risk factor inducing suicidal thoughts and behaviors among vulnerable populations. Despite global efforts, existing resources are insufficient, specifically in high-risk regions like the Republic of Korea. Current research mainly focuses on understanding negative effects of such content or suicide risk in individuals, rather than on automatically detecting the harmfulness of content. To fill this gap, we introduce a harmful suicide content detection task for classifying online suicide content into five harmfulness levels. We develop a multi-modal benchmark and a task description document in collaboration with medical professionals, and leverage large language models (LLMs) to explore efficient methods for moderating such content. Our contributions include proposing a novel detection task, a multi-modal Korean benchmark with expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Mental Health via Writing · Spam and Phishing Detection