TL;DR
This paper explores the use of Convolutional Neural Networks (CNNs) for classifying toxic comments, demonstrating their superiority over traditional bag-of-words methods in a large Wikipedia dataset.
Contribution
It introduces CNN-based models for toxic comment detection and compares their performance against traditional methods, showing improved accuracy.
Findings
CNNs outperform bag-of-words in toxic comment classification
Deep learning approaches show promising results in text analytics
CNNs reinforce research interest in modern text classification methods
Abstract
Flood of information is produced in a daily basis through the global Internet usage arising from the on-line interactive communications among users. While this situation contributes significantly to the quality of human life, unfortunately it involves enormous dangers, since on-line texts with high toxicity can cause personal attacks, on-line harassment and bullying behaviors. This has triggered both industrial and research community in the last few years while there are several tries to identify an efficient model for on-line toxic comment prediction. However, these steps are still in their infancy and new approaches and frameworks are required. On parallel, the data explosion that appears constantly, makes the construction of new machine learning computational tools for managing this information, an imperative need. Thankfully advances in hardware, cloud computing and big data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
