Understanding and Analyzing Inappropriately Targeting Language in Online Discourse: A Comparative Annotation Study

Baran Barbarestani; Isa Maks; Piek Vossen

arXiv:2505.16847·cs.CL·May 23, 2025

Understanding and Analyzing Inappropriately Targeting Language in Online Discourse: A Comparative Annotation Study

Baran Barbarestani, Isa Maks, Piek Vossen

PDF

Open Access

TL;DR

This study develops a comprehensive annotation framework combining human and AI inputs to detect inappropriately targeting language in online conversations, revealing the importance of context and uncovering new targeting categories.

Contribution

It introduces a novel multi-source annotation approach and provides a comparative analysis of human and AI methods for identifying nuanced hate speech.

Findings

01

Context significantly affects hate speech detection accuracy.

02

ChatGPT struggles with nuanced and subtle targeting language.

03

New categories like social belief and body image were identified.

Abstract

This paper introduces a method for detecting inappropriately targeting language in online conversations by integrating crowd and expert annotations with ChatGPT. We focus on English conversation threads from Reddit, examining comments that target individuals or groups. Our approach involves a comprehensive annotation framework that labels a diverse data set for various target categories and specific target words within the conversational context. We perform a comparative analysis of annotations from human experts, crowd annotators, and ChatGPT, revealing strengths and limitations of each method in recognizing both explicit hate speech and subtler discriminatory language. Our findings highlight the significant role of contextual factors in identifying hate speech and uncover new categories of targeting, such as social belief and body image. We also address the challenges and subjective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection · Misinformation and Its Impacts

MethodsFocus · Sparse Evolutionary Training