Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

Jieyu Zhao; Tianlu Wang; Mark Yatskar; Vicente Ordonez; Kai-Wei Chang

arXiv:1804.06876·cs.CL·April 20, 2018·95 cites

Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, Kai-Wei Chang

PDF

Open Access 4 Repos 10 Models 1 Datasets

TL;DR

This paper introduces WinoBias, a benchmark for evaluating gender bias in coreference resolution systems, and proposes a debiasing method that reduces bias without harming overall performance.

Contribution

The paper presents WinoBias, a new dataset for gender bias evaluation, and combines data augmentation with embedding debiasing to mitigate bias in coreference models.

Findings

01

Coreference systems favor stereotypical gender-entity links by 21.1 F1 points.

02

Debiasing techniques reduce gender bias in models.

03

Performance on existing benchmarks remains unaffected by debiasing.

Abstract

We introduce a new benchmark, WinoBias, for coreference resolution focused on gender bias. Our corpus contains Winograd-schema style sentences with entities corresponding to people referred by their occupation (e.g. the nurse, the doctor, the carpenter). We demonstrate that a rule-based, a feature-rich, and a neural coreference system all link gendered pronouns to pro-stereotypical entities with higher accuracy than anti-stereotypical entities, by an average difference of 21.1 in F1 score. Finally, we demonstrate a data-augmentation approach that, in combination with existing word-embedding debiasing techniques, removes the bias demonstrated by these systems in WinoBias without significantly affecting their performance on existing coreference benchmark datasets. Our dataset and code are available at http://winobias.org.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

uclanlp/wino_bias
dataset· 2.8k dl
2.8k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification