NLP-Based Review for Toxic Comment Detection Tailored to the Chinese Cyberspace
Ruixing Ren, Junhui Zhao, Xiaoke Sun, Qiuping Li

TL;DR
This paper reviews NLP techniques for detecting toxic comments in Chinese cyberspace, highlighting challenges due to language complexity and proposing a scalable framework for better classification and interpretability.
Contribution
It introduces a novel fine-grained, scalable framework for toxic comment detection tailored to Chinese language nuances, addressing dataset construction and model interpretability.
Findings
Analysis of Chinese toxic comment characteristics
Review of detection models from traditional to deep learning
Discussion of open challenges and future directions
Abstract
With the in-depth integration of mobile Internet and widespread adoption of social platforms, user-generated content in the Chinese cyberspace has witnessed explosive growth. Among this content, the proliferation of toxic comments poses severe challenges to individual mental health, community atmosphere and social trust. Owing to the strong context dependence, cultural specificity and rapid evolution of Chinese cyber language, toxic expressions are often conveyed through complex forms such as homophones and metaphors, imposing notable limitations on traditional detection methods. To address this issue, this review focuses on the core topic of natural language processing based toxic comment detection in the Chinese cyberspace, systematically collating and critically analyzing the research progress and key challenges in this field. This review first defines the connotation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Topic Modeling · Misinformation and Its Impacts
