Evolving Hate Speech Online: An Adaptive Framework for Detection and Mitigation
Shiza Ali, Jeremy Blackburn, Gianluca Stringhini

TL;DR
This paper introduces an adaptive hybrid model combining BERT and lexicon updates to improve hate speech detection online, effectively identifying emerging slurs and linguistic patterns to enhance safety.
Contribution
It presents a novel adaptive framework that updates lexicons using word embeddings and integrates them with BERT for improved hate speech detection.
Findings
Achieves 95% accuracy on multiple datasets
Effectively detects new and obfuscated hate speech
Proactively updates lexicons to adapt to emerging language
Abstract
The proliferation of social media platforms has led to an increase in the spread of hate speech, particularly targeting vulnerable communities. Unfortunately, existing methods for automatically identifying and blocking toxic language rely on pre-constructed lexicons, making them reactive rather than adaptive. As such, these approaches become less effective over time, especially when new communities are targeted with slurs not included in the original datasets. To address this issue, we present an adaptive approach that uses word embeddings to update lexicons and develop a hybrid model that adjusts to emerging slurs and new linguistic patterns. This approach can effectively detect toxic language, including intentional spelling mistakes employed by aggressors to avoid detection. Our hybrid model, which combines BERT with lexicon-based techniques, achieves an accuracy of 95% for most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Freedom of Expression and Defamation
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Adam · Softmax · Dropout · Weight Decay · WordPiece · Layer Normalization · Residual Connection · Linear Layer
