Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel, Tomer Wullach, Amir Adler, Einat Minkov

TL;DR
This paper evaluates the use of generative AI to create synthetic hate speech data for training large language models, improving their ability to detect hate speech across diverse datasets.
Contribution
It provides a comprehensive review and empirical analysis of data augmentation with generative AI for hate speech detection, including comparisons with zero-shot GPT-3.5 performance.
Findings
Data augmentation improves recall across datasets.
GPT-3.5 achieves better generalization but lower precision.
Synthetic data helps models generalize better to unseen hate speech.
Abstract
Automatic hate speech detection using deep neural models is hampered by the scarcity of labeled datasets, leading to poor generalization. To mitigate this problem, generative AI has been utilized to generate large amounts of synthetic hate speech sequences from available labeled examples, leveraging the generated data in finetuning large pre-trained language models (LLMs). In this chapter, we provide a review of relevant methods, experimental setups and evaluation of this approach. In addition to general LLMs, such as BERT, RoBERTa and ALBERT, we apply and evaluate the impact of train set augmentation with generated data using LLMs that have been already adapted for hate detection, including RoBERTa-Toxicity, HateBERT, HateXplain, ToxDect, and ToxiGen. An empirical study corroborates our previous findings, showing that this approach improves hate speech generalization, boosting recall…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsSparse Evolutionary Training · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Byte Pair Encoding · Dense Connections · Linear Layer · WordPiece · {Dispute@FaQ-s}How to file a dispute with Expedia?
