A Federated Approach to Few-Shot Hate Speech Detection for Marginalized Communities
Haotian Ye, Axel Wisiorek, Antonis Maronikolakis, \"Ozge Ala\c{c}am,, Hinrich Sch\"utze

TL;DR
This paper introduces REACT, a collection of hate speech datasets for marginalized communities in low-resource languages, and proposes a federated learning approach for privacy-preserving, few-shot hate speech detection that adapts to specific groups.
Contribution
It provides the first culture-specific hate speech datasets for marginalized groups and develops a federated, few-shot detection method with personalized models for improved accuracy.
Findings
Federated learning shows robustness across diverse target groups.
Personalized models outperform generic models in hate speech detection.
REACT datasets enable research in low-resource, culture-specific hate speech detection.
Abstract
Hate speech online remains an understudied issue for marginalized communities, particularly in the Global South, which includes developing societies with increasing internet penetration. In this paper, we aim to provide marginalized communities in societies where the dominant language is low-resource with a privacy-preserving tool to protect themselves from online hate speech by filtering offensive content in their native languages. Our contributions are twofold: 1) we release REACT (REsponsive hate speech datasets Across ConTexts), a collection of high-quality, culture-specific hate speech detection datasets comprising multiple target groups and low-resource languages, curated by experienced data collectors; 2) we propose a few-shot hate speech detection approach based on federated learning (FL), a privacy-preserving method for collaboratively training a central model that exhibits…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHate Speech and Cyberbullying Detection
