A Survey of Toxic Comment Classification Methods
Kehan Wang, Jiaxi Yang, Hongjun Wu

TL;DR
This survey reviews various machine learning methods for toxic comment classification, highlighting recent advances with CNN, LSTM, and Naive Bayes, and emphasizing improved accuracy through word embeddings.
Contribution
It provides a comprehensive comparison of CNN, LSTM, and Naive Bayes models for toxicity detection, demonstrating enhanced accuracy with word embedding techniques.
Findings
LSTM and CNN achieved very high accuracy in toxicity detection.
Word embeddings significantly improved model performance.
Compared models outperform traditional Naive Bayes solutions.
Abstract
While in real life everyone behaves themselves at least to some extent, it is much more difficult to expect people to behave themselves on the internet, because there are few checks or consequences for posting something toxic to others. Yet, for people on the other side, toxic texts often lead to serious psychological consequences. Detecting such toxic texts is challenging. In this paper, we attempt to build a toxicity detector using machine learning methods including CNN, Naive Bayes model, as well as LSTM. While there has been numerous groundwork laid by others, we aim to build models that provide higher accuracy than the predecessors. We produced very high accuracy models using LSTM and CNN, and compared them to the go-to solutions in language processing, the Naive Bayes model. A word embedding approach is also applied to empower the accuracy of our models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
