Type B Reflexivization as an Unambiguous Testbed for Multilingual   Multi-Task Gender Bias

Ana Valeria Gonzalez; Maria Barrett; Rasmus Hvingelby; Kellie Webster,; Anders S{\o}gaard

arXiv:2009.11982·cs.CL·September 29, 2020

Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias

Ana Valeria Gonzalez, Maria Barrett, Rasmus Hvingelby, Kellie Webster,, Anders S{\o}gaard

PDF

2 Repos

TL;DR

This paper introduces a multilingual, multi-task challenge dataset based on type B reflexivization in Swedish and Russian to detect gender bias in NLP models, revealing biases across languages and tasks.

Contribution

It presents a novel challenge dataset leveraging reflexivization in multiple languages to unambiguously detect gender bias in NLP models, expanding beyond English-focused studies.

Findings

01

Gender bias is present across all task-language combinations.

02

Model bias correlates with national labor market statistics.

03

Reflexivization provides a clear testbed for gender bias detection.

Abstract

The one-sided focus on English in previous studies of gender bias in NLP misses out on opportunities in other languages: English challenge datasets such as GAP and WinoGender highlight model preferences that are "hallucinatory", e.g., disambiguating gender-ambiguous occurrences of 'doctor' as male doctors. We show that for languages with type B reflexivization, e.g., Swedish and Russian, we can construct multi-task challenge datasets for detecting gender bias that lead to unambiguously wrong model predictions: In these languages, the direct translation of 'the doctor removed his mask' is not ambiguous between a coreferential reading and a disjoint reading. Instead, the coreferential reading requires a non-gendered pronoun, and the gendered, possessive pronouns are anti-reflexive. We present a multilingual, multi-task challenge dataset, which spans four languages and four NLP tasks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.