Toxicity Detection for Indic Multilingual Social Media Content
Manan Jhaveri, Devanshu Ramaiya, Harveen Singh Chadha

TL;DR
This paper presents a multilingual toxicity detection system for Indic social media content, leveraging transformer models and data augmentation techniques, achieving top leaderboard performance in a shared challenge.
Contribution
The paper introduces an ensemble approach using XLM-RoBERTa and MuRIL for code-mixed toxicity classification, with enhancements from transliterated data and metadata, achieving state-of-the-art results.
Findings
Ensemble of XLM-RoBERTa and MuRIL achieved 0.9 F-1 score.
Adding transliterated data improved model performance.
Metadata and post-processing techniques further boosted results.
Abstract
Toxic content is one of the most critical issues for social media platforms today. India alone had 518 million social media users in 2020. In order to provide a good experience to content creators and their audience, it is crucial to flag toxic comments and the users who post that. But the big challenge is identifying toxicity in low resource Indic languages because of the presence of multiple representations of the same text. Moreover, the posts/comments on social media do not adhere to a particular format, grammar or sentence structure; this makes the task of abuse detection even more challenging for multilingual social media platforms. This paper describes the system proposed by team 'Moj Masti' using the data provided by ShareChat/Moj in \emph{IIIT-D Multilingual Abusive Comment Identification} challenge. We focus on how we can leverage multilingual transformer based pre-trained and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
