Task Adaptive Pretraining of Transformers for Hostility Detection

Tathagata Raha; Sayar Ghosh Roy; Ujwal Narayan; Zubair Abid; Vasudeva; Varma

arXiv:2101.03382·cs.CL·January 12, 2021

Task Adaptive Pretraining of Transformers for Hostility Detection

Tathagata Raha, Sayar Ghosh Roy, Ujwal Narayan, Zubair Abid, Vasudeva, Varma

PDF

TL;DR

This paper demonstrates that Task Adaptive Pretraining (TAPT) significantly improves Transformer-based models for detecting hostility in social media, achieving top results in Hindi tweet classification tasks.

Contribution

It introduces the application of TAPT prior to fine-tuning Transformer models for hostility detection, showing performance gains in both coarse and fine-grained classification tasks.

Findings

01

TAPT enhances classification accuracy for hostility detection.

02

The system achieved 97.16% F1 score in coarse classification.

03

It ranked first in the shared task for Hindi hostility detection.

Abstract

Identifying adverse and hostile content on the web and more particularly, on social media, has become a problem of paramount interest in recent years. With their ever increasing popularity, fine-tuning of pretrained Transformer-based encoder models with a classifier head are gradually becoming the new baseline for natural language classification tasks. In our work, we explore the gains attributed to Task Adaptive Pretraining (TAPT) prior to fine-tuning of Transformer-based architectures. We specifically study two problems, namely, (a) Coarse binary classification of Hindi Tweets into Hostile or Not, and (b) Fine-grained multi-label classification of Tweets into four categories: hate, fake, offensive, and defamation. Building up on an architecture which takes emojis and segmented hashtags into consideration for classification, we are able to experimentally showcase the performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.