A Survey of Toxic Comment Classification Methods

Kehan Wang; Jiaxi Yang; Hongjun Wu

arXiv:2112.06412·cs.CL·December 14, 2021

A Survey of Toxic Comment Classification Methods

Kehan Wang, Jiaxi Yang, Hongjun Wu

PDF

Open Access

TL;DR

This survey reviews various machine learning methods for toxic comment classification, highlighting recent advances with CNN, LSTM, and Naive Bayes, and emphasizing improved accuracy through word embeddings.

Contribution

It provides a comprehensive comparison of CNN, LSTM, and Naive Bayes models for toxicity detection, demonstrating enhanced accuracy with word embedding techniques.

Findings

01

LSTM and CNN achieved very high accuracy in toxicity detection.

02

Word embeddings significantly improved model performance.

03

Compared models outperform traditional Naive Bayes solutions.

Abstract

While in real life everyone behaves themselves at least to some extent, it is much more difficult to expect people to behave themselves on the internet, because there are few checks or consequences for posting something toxic to others. Yet, for people on the other side, toxic texts often lead to serious psychological consequences. Detecting such toxic texts is challenging. In this paper, we attempt to build a toxicity detector using machine learning methods including CNN, Naive Bayes model, as well as LSTM. While there has been numerous groundwork laid by others, we aim to build models that provide higher accuracy than the predecessors. We produced very high accuracy models using LSTM and CNN, and compared them to the go-to solutions in language processing, the Naive Bayes model. A word embedding approach is also applied to empower the accuracy of our models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory