Battling Hateful Content in Indic Languages HASOC '21

Aditya Kadam; Anmol Goel; Jivitesh Jain; Jushaan Singh Kalra; Mallika; Subramanian; Manvith Reddy; Prashant Kodali; T.H. Arjun; Manish Shrivastava,; Ponnurangam Kumaraguru

arXiv:2110.12780·cs.CL·November 8, 2021

Battling Hateful Content in Indic Languages HASOC '21

Aditya Kadam, Anmol Goel, Jivitesh Jain, Jushaan Singh Kalra, Mallika, Subramanian, Manvith Reddy, Prashant Kodali, T.H. Arjun, Manish Shrivastava,, Ponnurangam Kumaraguru

PDF

Open Access 1 Repo

TL;DR

This paper addresses hate speech detection in multilingual and code-mixed social media texts using transformer models, achieving a top-three ranking in the HASOC 2021 challenge.

Contribution

It introduces a multilingual transformer-based approach tailored for hate speech detection across six subtasks in a multilingual Twitter dataset.

Findings

01

Achieved 3rd place overall in the challenge

02

Effective handling of code-mixed and multilingual texts

03

Demonstrated the viability of transformer models for hate speech detection

Abstract

The extensive rise in consumption of online social media (OSMs) by a large number of people poses a critical problem of curbing the spread of hateful content on these platforms. With the growing usage of OSMs in multiple languages, the task of detecting and characterizing hate becomes more complex. The subtle variations of code-mixed texts along with switching scripts only add to the complexity. This paper presents a solution for the HASOC 2021 Multilingual Twitter Hate-Speech Detection challenge by team PreCog IIIT Hyderabad. We adopt a multilingual transformer based approach and describe our architecture for all 6 subtasks as part of the challenge. Out of the 6 teams that participated in all the subtasks, our submissions rank 3rd overall.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adi2k/precog-hasoc-2021
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Advanced Malware Detection Techniques