A Taxonomy of Rater Disagreements: Surveying Challenges & Opportunities   from the Perspective of Annotating Online Toxicity

Wenbo Zhang; Hangzhi Guo; Ian D Kivlichan; Vinodkumar Prabhakaran,; Davis Yadav; Amulya Yadav

arXiv:2311.04345·cs.CL·November 9, 2023·1 cites

A Taxonomy of Rater Disagreements: Surveying Challenges & Opportunities from the Perspective of Annotating Online Toxicity

Wenbo Zhang, Hangzhi Guo, Ian D Kivlichan, Vinodkumar Prabhakaran,, Davis Yadav, Amulya Yadav

PDF

Open Access

TL;DR

This paper surveys the causes of rater disagreements in annotating online toxicity, proposing a detailed taxonomy and discussing solutions to improve dataset quality and toxicity detection models.

Contribution

It introduces a comprehensive taxonomy of rater disagreement causes specific to online toxicity annotation, filling a gap in understanding and guiding future research.

Findings

01

Identifies multiple root causes of rater disagreement.

02

Summarizes potential solutions for each cause.

03

Highlights open issues for future work.

Abstract

Toxicity is an increasingly common and severe issue in online spaces. Consequently, a rich line of machine learning research over the past decade has focused on computationally detecting and mitigating online toxicity. These efforts crucially rely on human-annotated datasets that identify toxic content of various kinds in social media texts. However, such annotations historically yield low inter-rater agreement, which was often dealt with by taking the majority vote or other such approaches to arrive at a single ground truth label. Recent research has pointed out the importance of accounting for the subjective nature of this task when building and utilizing these datasets, and this has triggered work on analyzing and better understanding rater disagreements, and how they could be effectively incorporated into the machine learning developmental pipeline. While these efforts are filling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Software Engineering Research

MethodsSparse Evolutionary Training