Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets

Tommaso Giorgi; Lorenzo Cima; Tiziano Fagni; Marco Avvenuti; Stefano Cresci

arXiv:2410.07991·cs.CL·June 13, 2025

Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets

Tommaso Giorgi, Lorenzo Cima, Tiziano Fagni, Marco Avvenuti, Stefano Cresci

PDF

TL;DR

This paper investigates socio-demographic biases in human and LLM hate speech annotations, revealing significant differences and providing insights for improving AI-based detection systems.

Contribution

It introduces a comprehensive socio-demographic analysis of biases in hate speech annotations from both humans and LLMs, filling a gap in understanding bias sources.

Findings

01

Widespread biases in human annotations linked to socio-demographic factors

02

Significant differences between human and LLM biases

03

Insights for designing less biased hate speech detection systems

Abstract

The rise of online platforms exacerbated the spread of hate speech, demanding scalable and effective detection. However, the accuracy of hate speech detection systems heavily relies on human-labeled data, which is inherently susceptible to biases. While previous work has examined the issue, the interplay between the characteristics of the annotator and those of the target of the hate are still unexplored. We fill this gap by leveraging an extensive dataset with rich socio-demographic information of both annotators and targets, uncovering how human biases manifest in relation to the target's attributes. Our analysis surfaces the presence of widespread biases, which we quantitatively describe and characterize based on their intensity and prevalence, revealing marked differences. Furthermore, we compare human biases with those exhibited by persona-based LLMs. Our findings indicate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.