Cyberbullying Detection in Hinglish Text Using MURIL and Explainable AI
Devesh Kumar

TL;DR
This paper introduces a MURIL-based framework for detecting cyberbullying in Hinglish text, outperforming existing models and incorporating explainability, with insights into challenges like sarcasm and cultural nuances.
Contribution
The study develops a novel multilingual detection framework using MURIL architecture tailored for Hinglish, with explainability features and comprehensive evaluation across multiple datasets.
Findings
MURIL-based approach outperforms RoBERTa and IndicBERT in accuracy.
Achieves up to 94.63% accuracy on Mendeley dataset.
Identifies key challenges like sarcasm and cultural context in detection.
Abstract
The growth of digital communication platforms has led to increased cyberbullying incidents worldwide, creating a need for automated detection systems to protect users. The rise of code-mixed Hindi-English (Hinglish) communication on digital platforms poses challenges for existing cyberbullying detection systems, which were designed primarily for monolingual text. This paper presents a framework for cyberbullying detection in Hinglish text using the Multilingual Representations for Indian Languages (MURIL) architecture to address limitations in current approaches. Evaluation across six benchmark datasets -- Bohra \textit{et al.}, BullyExplain, BullySentemo, Kumar \textit{et al.}, HASOC 2021, and Mendeley Indo-HateSpeech -- shows that the MURIL-based approach outperforms existing multilingual models including RoBERTa and IndicBERT, with improvements of 1.36 to 13.07 percentage points and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Sentiment Analysis and Opinion Mining
