Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis
Bj\"orn Ross, Michael Rist, Guillermo Carbonell, Benjamin Cabrera,, Nils Kurowsky, Michael Wojatzki

TL;DR
This study investigates the reliability of hate speech annotations related to the European Refugee Crisis, revealing low consistency and suggesting the need for more nuanced annotation methods and clearer guidelines.
Contribution
It provides empirical evidence that hate speech annotation reliability is low and that definitions alone do not significantly improve consistency among raters.
Findings
Showing definitions partially aligns raters' opinions.
Overall annotation reliability remains very low.
More detailed instructions are needed for better annotation consistency.
Abstract
Some users of social media are spreading racist, sexist, and otherwise hateful content. For the purpose of training a hate speech detection system, the reliability of the annotations is crucial, but there is no universally agreed-upon definition. We collected potentially hateful messages and asked two groups of internet users to determine whether they were hate speech or not, whether they should be banned or not and to rate their degree of offensiveness. One of the groups was shown a definition prior to completing the survey. We aimed to assess whether hate speech can be annotated reliably, and the extent to which existing definitions are in accordance with subjective ratings. Our results indicate that showing users a definition caused them to partially align their own opinion with the definition but did not improve reliability, which was very low overall. We conclude that the presence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Bullying, Victimization, and Aggression
