Improved two-stage hate speech classification for twitter based on Deep Neural Networks
Georgios K. Pitsilis

TL;DR
This paper introduces a two-stage deep neural network approach using enhanced LSTM classifiers for improved hate speech detection on Twitter, demonstrating superior performance over existing methods.
Contribution
It presents a novel two-stage RNN-based scheme for hate speech classification, improving accuracy in detecting specific forms of online harassment.
Findings
Outperforms current state-of-the-art in hate speech detection
Effective in identifying racism and sexism in social media posts
Validated on multiple datasets with superior results
Abstract
Hate speech is a form of online harassment that involves the use of abusive language, and it is commonly seen in social media posts. This sort of harassment mainly focuses on specific group characteristics such as religion, gender, ethnicity, etc and it has both societal and economic consequences nowadays. The automatic detection of abusive language in text postings has always been a difficult task, but it is lately receiving much interest from the scientific community. This paper addresses the important problem of discerning hateful content in social media. The model we propose in this work is an extension of an existing approach based on LSTM neural network architectures, which we appropriately enhanced and fine-tuned to detect certain forms of hatred language, such as racism or sexism, in a short text. The most significant enhancement is the conversion to a two-stage scheme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
