Measuring the Reliability of Hate Speech Annotations: The Case of the   European Refugee Crisis

Bj\"orn Ross; Michael Rist; Guillermo Carbonell; Benjamin Cabrera,; Nils Kurowsky; Michael Wojatzki

arXiv:1701.08118·cs.CL·January 30, 2017·107 cites

Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis

Bj\"orn Ross, Michael Rist, Guillermo Carbonell, Benjamin Cabrera,, Nils Kurowsky, Michael Wojatzki

PDF

Open Access 1 Repo 2 Models 1 Datasets

TL;DR

This study investigates the reliability of hate speech annotations related to the European Refugee Crisis, revealing low consistency and suggesting the need for more nuanced annotation methods and clearer guidelines.

Contribution

It provides empirical evidence that hate speech annotation reliability is low and that definitions alone do not significantly improve consistency among raters.

Findings

01

Showing definitions partially aligns raters' opinions.

02

Overall annotation reliability remains very low.

03

More detailed instructions are needed for better annotation consistency.

Abstract

Some users of social media are spreading racist, sexist, and otherwise hateful content. For the purpose of training a hate speech detection system, the reliability of the annotations is crucial, but there is no universally agreed-upon definition. We collected potentially hateful messages and asked two groups of internet users to determine whether they were hate speech or not, whether they should be banned or not and to rate their degree of offensiveness. One of the groups was shown a definition prior to completing the survey. We aimed to assess whether hate speech can be annotated reliably, and the extent to which existing definitions are in accordance with subjective ratings. Our results indicate that showing users a definition caused them to partially align their own opinion with the definition but did not improve reliability, which was very low overall. We conclude that the presence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UCSM-DUE/IWG_hatespeech_public
noneOfficial

Models

Datasets

manueltonneau/german-hate-speech-superset
dataset· 57 dl
57 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Bullying, Victimization, and Aggression