A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

Fitsum Gaim; Hoyun Song; Huije Lee; Changgeon Ko; Eui Jun Hwang; Jong C. Park

arXiv:2505.12116·cs.CL·October 28, 2025

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

Fitsum Gaim, Hoyun Song, Huije Lee, Changgeon Ko, Eui Jun Hwang, Jong C. Park

PDF

Open Access 1 Repo 2 Models 1 Datasets

TL;DR

This paper introduces a large, annotated multi-task benchmark dataset for abusive language detection in Tigrinya social media, addressing resource scarcity and language diversity challenges in content moderation.

Contribution

It provides the first comprehensive dataset with joint annotations for abusiveness, sentiment, and topic classification in Tigrinya, including Romanized and native scripts, and establishes strong baseline models.

Findings

01

Small fine-tuned models outperform large language models in low-resource settings.

02

Achieved 86.67% F1 in abusiveness detection, surpassing LLMs by 7+ points.

03

Dataset covers 13,717 comments from 7,373 videos across 51 channels.

Abstract

Content moderation research has recently made significant advances, but remains limited in serving the majority of the world's languages due to the lack of resources, leaving millions of vulnerable users to online hostility. This work presents a large-scale human-annotated multi-task benchmark dataset for abusive language detection in Tigrinya social media with joint annotations for three tasks: abusiveness, sentiment, and topic classification. The dataset comprises 13,717 YouTube comments annotated by nine native speakers, collected from 7,373 videos with a total of over 1.2 billion views across 51 channels. We developed an iterative term clustering approach for effective data selection. Recognizing that around 64% of Tigrinya social media content uses Romanized transliterations rather than native Ge'ez script, our dataset accommodates both writing systems to reflect actual language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fgaim/tigrinya-abusive-language-detection
noneOfficial

Models

Datasets

fgaim/tigrinya-abusive-language-detection
dataset· 271 dl
271 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Topic Modeling