Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi,, Noah A. Smith

TL;DR
This study explores how annotator identities and beliefs influence toxicity ratings in language, revealing biases that impact dataset quality and suggesting the need for context-aware annotation practices.
Contribution
It provides empirical evidence linking annotator social identities and beliefs to toxicity judgments, highlighting biases in toxic language datasets and detection systems.
Findings
Conservative annotators less likely to label anti-Black language as toxic.
Annotators with racist beliefs more likely to label AAE as toxic.
Toxicity detection systems reflect specific social perspectives.
Abstract
The perceived toxicity of language can vary based on someone's identity and beliefs, but this variation is often ignored when collecting toxic language datasets, resulting in dataset and model biases. We seek to understand the who, why, and what behind biases in toxicity annotations. In two online studies with demographically and politically diverse participants, we investigate the effect of annotator identities (who) and beliefs (why), drawing from social psychology research about hate speech, free speech, racist beliefs, political leaning, and more. We disentangle what is annotated as toxic by considering posts with three characteristics: anti-Black language, African American English (AAE) dialect, and vulgarity. Our results show strong associations between annotator identity and beliefs and their ratings of toxicity. Notably, more conservative annotators and those who scored highly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Discourse Analysis in Language Studies · Social Media and Politics
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
