When the Majority is Wrong: Modeling Annotator Disagreement for   Subjective Tasks

Eve Fleisig; Rediet Abebe; Dan Klein

arXiv:2305.06626·cs.CL·March 19, 2024·1 cites

When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks

Eve Fleisig, Rediet Abebe, Dan Klein

PDF

Open Access

TL;DR

This paper introduces a model for understanding annotator disagreement in subjective tasks like hate speech detection, emphasizing the importance of demographic context and opinion modeling to improve prediction accuracy and interpretability.

Contribution

It presents a novel approach to predict individual annotator ratings and target group opinions without linking to personal identifiers, enhancing privacy and accuracy.

Findings

01

Model improves annotator rating prediction by 22%.

02

Model increases variance prediction accuracy by 33%.

03

Using demographic info and online opinions suffices for accurate predictions.

Abstract

Though majority vote among annotators is typically used for ground truth labels in natural language processing, annotator disagreement in tasks such as hate speech detection may reflect differences in opinion across groups, not noise. Thus, a crucial problem in hate speech detection is determining whether a statement is offensive to the demographic group that it targets, when that group may constitute a small fraction of the annotator pool. We construct a model that predicts individual annotator ratings on potentially offensive text and combines this information with the predicted target group of the text to model the opinions of target group members. We show gains across a range of metrics, including raising performance over the baseline by 22% at predicting individual annotators' ratings and by 33% at predicting variance among annotators, which provides a metric for model uncertainty…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Social Media and Politics