Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods
Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, Kai-Wei Chang

TL;DR
This paper introduces WinoBias, a benchmark for evaluating gender bias in coreference resolution systems, and proposes a debiasing method that reduces bias without harming overall performance.
Contribution
The paper presents WinoBias, a new dataset for gender bias evaluation, and combines data augmentation with embedding debiasing to mitigate bias in coreference models.
Findings
Coreference systems favor stereotypical gender-entity links by 21.1 F1 points.
Debiasing techniques reduce gender bias in models.
Performance on existing benchmarks remains unaffected by debiasing.
Abstract
We introduce a new benchmark, WinoBias, for coreference resolution focused on gender bias. Our corpus contains Winograd-schema style sentences with entities corresponding to people referred by their occupation (e.g. the nurse, the doctor, the carpenter). We demonstrate that a rule-based, a feature-rich, and a neural coreference system all link gendered pronouns to pro-stereotypical entities with higher accuracy than anti-stereotypical entities, by an average difference of 21.1 in F1 score. Finally, we demonstrate a data-augmentation approach that, in combination with existing word-embedding debiasing techniques, removes the bias demonstrated by these systems in WinoBias without significantly affecting their performance on existing coreference benchmark datasets. Our dataset and code are available at http://winobias.org.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google/gemma-7bmodel· 30k dl· ♡ 329330k dl♡ 3293
- 🤗google/gemma-2-2b-itmodel· 368k dl· ♡ 1314368k dl♡ 1314
- 🤗google/gemma-2-2bmodel· 489k dl· ♡ 636489k dl♡ 636
- 🤗google/gemma-2bmodel· 174k dl· ♡ 1152174k dl♡ 1152
- 🤗google/gemma-2-27b-itmodel· 309k dl· ♡ 561309k dl♡ 561
- 🤗google/gemma-2-9b-itmodel· 254k dl· ♡ 781254k dl♡ 781
- 🤗ataeff/recurrentgemma-2b-itmodel· ♡ 1♡ 1
- 🤗fairnlp/bert-cdamodel· 3 dl3 dl
- 🤗fairnlp/albert-cdamodel· 6 dl6 dl
- 🤗google/gemma-2b-itmodel· 57k dl· ♡ 86257k dl♡ 862
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
