Dealing with Annotator Disagreement in Hate Speech Classification

Somaiyeh Dehghan; Mehmet Umut Sen; Berrin Yanikoglu

arXiv:2502.08266·cs.CL·August 19, 2025·2 cites

Dealing with Annotator Disagreement in Hate Speech Classification

Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu

PDF

Open Access

TL;DR

This paper investigates strategies to handle annotator disagreement in hate speech classification, emphasizing the importance of addressing subjectivity in labeling and providing benchmark results for Turkish tweets.

Contribution

It evaluates automatic aggregation methods for multiple annotations in hate speech detection, offering new insights and state-of-the-art benchmarks for Turkish social media data.

Findings

01

Automatic aggregation improves label quality in hate speech datasets

02

Addressing annotator disagreement enhances model performance

03

Provides benchmark results for Turkish hate speech classification

Abstract

Hate speech detection is a crucial task, especially on social media, where harmful content can spread quickly. Implementing machine learning models to automatically identify and address hate speech is essential for mitigating its impact and preventing its proliferation. The first step in developing an effective hate speech detection model is to acquire a high-quality dataset for training. Labeled data is essential for most natural language processing tasks, but categorizing hate speech is difficult due to the diverse and often subjective nature of hate speech, which can lead to varying interpretations and disagreements among annotators. This paper examines strategies for addressing annotator disagreement, an issue that has been largely overlooked. In particular, we evaluate various automatic approaches for aggregating multiple annotations, in the context of hate speech classification in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection