Cyberbullying Detection in Hinglish Text Using MURIL and Explainable AI

Devesh Kumar

arXiv:2506.16066·cs.CL·June 23, 2025

Cyberbullying Detection in Hinglish Text Using MURIL and Explainable AI

Devesh Kumar

PDF

Open Access

TL;DR

This paper introduces a MURIL-based framework for detecting cyberbullying in Hinglish text, outperforming existing models and incorporating explainability, with insights into challenges like sarcasm and cultural nuances.

Contribution

The study develops a novel multilingual detection framework using MURIL architecture tailored for Hinglish, with explainability features and comprehensive evaluation across multiple datasets.

Findings

01

MURIL-based approach outperforms RoBERTa and IndicBERT in accuracy.

02

Achieves up to 94.63% accuracy on Mendeley dataset.

03

Identifies key challenges like sarcasm and cultural context in detection.

Abstract

The growth of digital communication platforms has led to increased cyberbullying incidents worldwide, creating a need for automated detection systems to protect users. The rise of code-mixed Hindi-English (Hinglish) communication on digital platforms poses challenges for existing cyberbullying detection systems, which were designed primarily for monolingual text. This paper presents a framework for cyberbullying detection in Hinglish text using the Multilingual Representations for Indian Languages (MURIL) architecture to address limitations in current approaches. Evaluation across six benchmark datasets -- Bohra \textit{et al.}, BullyExplain, BullySentemo, Kumar \textit{et al.}, HASOC 2021, and Mendeley Indo-HateSpeech -- shows that the MURIL-based approach outperforms existing multilingual models including RoBERTa and IndicBERT, with improvements of 1.36 to 13.07 percentage points and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Sentiment Analysis and Opinion Mining