Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language
Arushi Sharma, Anubha Kabra, Minni Jain

TL;DR
This paper introduces MoH, a novel pipeline for hate speech detection in Hindi-English code-switched social media text, leveraging transliteration and multilingual models to improve detection accuracy significantly.
Contribution
The work presents MoH, a new method combining transliteration and fine-tuned multilingual models for effective hate speech detection in code-switched Hindi-English text.
Findings
MoH improves F1 scores by 13% with classical models.
MoH outperforms baseline models by 6%.
MoH achieves 15% higher performance with data simulations.
Abstract
Social media has become a bedrock for people to voice their opinions worldwide. Due to the greater sense of freedom with the anonymity feature, it is possible to disregard social etiquette online and attack others without facing severe consequences, inevitably propagating hate speech. The current measures to sift the online content and offset the hatred spread do not go far enough. One factor contributing to this is the prevalence of regional languages in social media and the paucity of language flexible hate speech detectors. The proposed work focuses on analyzing hate speech in Hindi-English code-switched language. Our method explores transformation techniques to capture precise text representation. To contain the structure of data and yet use it with existing algorithms, we developed MoH or Map Only Hindi, which means "Love" in Hindi. MoH pipeline consists of language identification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting
MethodsAttention Is All You Need · Linear Layer · Weight Decay · Softmax · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam
