Untangling Hate Speech Definitions: A Semantic Componential Analysis Across Cultures and Domains

Katerina Korre; Arianna Muti; Federico Ruggeri; Alberto Barr\'on-Cede\~no

arXiv:2411.07417·cs.CL·May 22, 2025

Untangling Hate Speech Definitions: A Semantic Componential Analysis Across Cultures and Domains

Katerina Korre, Arianna Muti, Federico Ruggeri, Alberto Barr\'on-Cede\~no

PDF

Open Access 1 Video

TL;DR

This paper introduces a Semantic Componential Analysis framework to study how hate speech definitions vary across cultures and domains, revealing significant differences and the influence of definitions on detection models.

Contribution

It presents the first cross-cultural, cross-domain dataset of hate speech definitions and demonstrates how semantic components affect large language model responses in hate speech detection.

Findings

01

Definitions vary significantly across cultures and domains.

02

Many domains borrow definitions without cultural adaptation.

03

LLMs' hate speech detection responses are sensitive to the complexity of definitions.

Abstract

Hate speech relies heavily on cultural influences, leading to varying individual interpretations. For that reason, we propose a Semantic Componential Analysis (SCA) framework for a cross-cultural and cross-domain analysis of hate speech definitions. We create the first dataset of hate speech definitions encompassing 493 definitions from more than 100 cultures, drawn from five key domains: online dictionaries, academic research, Wikipedia, legal texts, and online platforms. By decomposing these definitions into semantic components, our analysis reveals significant variation across definitions, yet many domains borrow definitions from one another without taking into account the target culture. We conduct zero-shot model experiments using our proposed dataset, employing three popular open-sourced LLMs to understand the impact of different definitions on hate speech detection. Our findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Untangling Hate Speech Definitions: A Semantic Componential Analysis Across Cultures and Domains· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection