Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution   and Machine Translation

Shahar Levy; Koren Lazar; Gabriel Stanovsky

arXiv:2109.03858·cs.CL·September 13, 2021

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Shahar Levy, Koren Lazar, Gabriel Stanovsky

PDF

Open Access 1 Repo

TL;DR

This paper introduces a large-scale, real-world English dataset of 108K sentences to evaluate gender bias in coreference resolution and machine translation, revealing models' reliance on stereotypes and showing potential for bias mitigation.

Contribution

It provides the first large-scale, real-world gender bias dataset for coreference and translation models, enabling more realistic bias evaluation and mitigation research.

Findings

01

Models tend to over-rely on gender stereotypes in natural inputs.

02

The dataset can be used to fine-tune models to reduce bias.

03

The dataset is publicly available for further research.

Abstract

Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale and consist mostly of artificial, out-of-distribution sentences. In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gender-role assignments (e.g., female nurses versus male dancers) in corpora from three domains, resulting in a first large-scale gender bias dataset of 108K diverse real-world English sentences. We manually verify the quality of our corpus and use it to evaluate gender bias in various coreference resolution and machine translation models. We find that all tested models tend to over-rely on gender stereotypes when presented with natural inputs, which may be especially harmful when deployed in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

slab-nlp/bug
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling