STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection

Zewen Bai; Shengdi Yin; Junyu Lu; Jingjie Zeng; Haohao Zhu; Yuanyuan Sun; Liang Yang; Hongfei Lin

arXiv:2501.15451·cs.CL·May 21, 2025

STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection

Zewen Bai, Shengdi Yin, Junyu Lu, Jingjie Zeng, Haohao Zhu, Yuanyuan Sun, Liang Yang, Hongfei Lin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces STATE ToxiCN, the first span-level Chinese hate speech dataset, and evaluates models on fine-grained detection, including hateful slang, advancing Chinese hate speech research.

Contribution

It constructs the first span-level Chinese hate speech dataset and assesses model performance, including on hateful slang detection, filling a significant research gap.

Findings

01

STATE ToxiCN enables detailed hate speech analysis.

02

Existing models show limited effectiveness on Chinese hate speech.

03

LLMs' ability to detect hateful slang varies significantly.

Abstract

The proliferation of hate speech has caused significant harm to society. The intensity and directionality of hate are closely tied to the target and argument it is associated with. However, research on hate speech detection in Chinese has lagged behind, and existing datasets lack span-level fine-grained annotations. Furthermore, the lack of research on Chinese hateful slang poses a significant challenge. In this paper, we provide a solution for fine-grained detection of Chinese hate speech. First, we construct a dataset containing Target-Argument-Hateful-Group quadruples (STATE ToxiCN), which is the first span-level Chinese hate speech dataset. Secondly, we evaluate the span-level hate speech detection performance of existing models using STATE ToxiCN. Finally, we conduct the first study on Chinese hateful slang and evaluate the ability of LLMs to detect such expressions. Our work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shenmeyemeifashengguo/STATE-ToxiCN
noneOfficial

Videos

STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection