TL;DR
This paper investigates gender biases in coreference resolution systems, emphasizing the importance of recognizing gender complexity to prevent systemic harms, and introduces new datasets for bias analysis.
Contribution
It develops two novel datasets to analyze gender bias in coreference resolution and highlights the need for gender-inclusive systems based on sociolinguistic insights.
Findings
Bias exists in current coreference systems regarding gender.
Recognizing gender complexity reduces potential harms.
New datasets enable better bias interrogation.
Abstract
Correctly resolving textual mentions of people fundamentally entails making inferences about those people. Such inferences raise the risk of systemic biases in coreference resolution systems, including biases that can harm binary and non-binary trans and cis stakeholders. To better understand such biases, we foreground nuanced conceptualizations of gender from sociology and sociolinguistics, and develop two new datasets for interrogating bias in crowd annotations and in existing coreference resolution systems. Through these studies, conducted on English text, we confirm that without acknowledging and building systems that recognize the complexity of gender, we build systems that lead to many potential harms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
