Developing Successful Shared Tasks on Offensive Language Identification   for Dravidian Languages

Bharathi Raja Chakravarthi; Dhivya Chinnappa; Ruba Priyadharshini,; Anand Kumar Madasamy; Sangeetha Sivanesan; Subalalitha Chinnaudayar; Navaneethakrishnan; Sajeetha Thavareesan; Dhanalakshmi Vadivel; Rahul; Ponnusamy; Prasanna Kumar Kumaresan

arXiv:2111.03375·cs.CL·November 8, 2021·1 cites

Developing Successful Shared Tasks on Offensive Language Identification for Dravidian Languages

Bharathi Raja Chakravarthi, Dhivya Chinnappa, Ruba Priyadharshini,, Anand Kumar Madasamy, Sangeetha Sivanesan, Subalalitha Chinnaudayar, Navaneethakrishnan, Sajeetha Thavareesan, Dhanalakshmi Vadivel, Rahul, Ponnusamy, Prasanna Kumar Kumaresan

PDF

Open Access

TL;DR

This paper introduces shared tasks for offensive language detection in under-resourced Dravidian languages, providing data, task definitions, and system evaluations to advance research in this area.

Contribution

It establishes evaluation frameworks and datasets for offensive language identification in Malayalam, Tamil, and Kannada, facilitating comparison of different approaches.

Findings

01

Multiple systems participated in the evaluation.

02

The datasets enabled benchmarking of offensive language detection.

03

The paper discusses various methods used by participants.

Abstract

With the fast growth of mobile computing and Web technologies, offensive language has become more prevalent on social networking platforms. Since offensive language identification in local languages is essential to moderate the social media content, in this paper we work with three Dravidian languages, namely Malayalam, Tamil, and Kannada, that are under-resourced. We present an evaluation task at FIRE 2020- HASOC-DravidianCodeMix and DravidianLangTech at EACL 2021, designed to provide a framework for comparing different approaches to this problem. This paper describes the data creation, defines the task, lists the participating systems, and discusses various methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection