ProvocationProbe: Instigating Hate Speech Dataset from Twitter

Abhay Kumar; Vigneshwaran Shankaran; Rajesh Sharma

arXiv:2410.19687·cs.CL·October 28, 2024

ProvocationProbe: Instigating Hate Speech Dataset from Twitter

Abhay Kumar, Vigneshwaran Shankaran, Rajesh Sharma

PDF

Open Access

TL;DR

This paper introduces ProvocationProbe, a new Twitter dataset with around twenty thousand tweets, aimed at distinguishing instigating hate speech from general hate speech through detailed annotations and feature analysis.

Contribution

The paper presents a novel dataset and analysis that differentiate instigating hate speech from general hate speech, focusing on identifying unique features and providing comprehensive annotations.

Findings

01

Identified key features distinguishing instigating hate speech from general hate speech.

02

Created a dataset covering nine global controversies with detailed annotations.

03

Highlighted the importance of targeted identity attacks and reasons for hate.

Abstract

In the recent years online social media platforms has been flooded with hateful remarks such as racism, sexism, homophobia etc. As a result, there have been many measures taken by various social media platforms to mitigate the spread of hate-speech over the internet. One particular concept within the domain of hate speech is instigating hate, which involves provoking hatred against a particular community, race, colour, gender, religion or ethnicity. In this work, we introduce \textit{ProvocationProbe} - a dataset designed to explore what distinguishes instigating hate speech from general hate speech. For this study, we collected around twenty thousand tweets from Twitter, encompassing a total of nine global controversies. These controversies span various themes including racism, politics, and religion. In this paper, i) we present an annotated dataset after comprehensive examination of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection