Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree

Harbani Jaggi; Kashyap Murali; Eve Fleisig; Erdem B{\i}y{\i}k

arXiv:2410.12217·cs.CL·October 17, 2024

Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree

Harbani Jaggi, Kashyap Murali, Eve Fleisig, Erdem B{\i}y{\i}k

PDF

Open Access

TL;DR

This paper introduces new methods for predicting individual annotator ratings in toxicity detection, especially when annotators disagree, by leveraging annotator-specific data and demographics, improving prediction accuracy.

Contribution

The paper presents three novel approaches for modeling individual annotator ratings, highlighting the effectiveness of embedding-based architectures and survey-derived demographics in subjective NLP tasks.

Findings

01

Embedding-based architecture outperforms other methods.

02

Demographics from survey data are nearly as effective as true demographics.

03

Integrating annotator history and demographics improves rating prediction accuracy.

Abstract

When annotators disagree, predicting the labels given by individual annotators can capture nuances overlooked by traditional label aggregation. We introduce three approaches to predicting individual annotator ratings on the toxicity of text by incorporating individual annotator-specific information: a neural collaborative filtering (NCF) approach, an in-context learning (ICL) approach, and an intermediate embedding-based architecture. We also study the utility of demographic information for rating prediction. NCF showed limited utility; however, integrating annotator history, demographics, and survey information permits both the embedding-based architecture and ICL to substantially improve prediction accuracy, with the embedding-based architecture outperforming the other methods. We also find that, if demographics are predicted from survey information, using these imputed demographics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Software Engineering Research · Mobile Crowdsensing and Crowdsourcing