TL;DR
This paper introduces a multilingual, multi-task challenge dataset based on type B reflexivization in Swedish and Russian to detect gender bias in NLP models, revealing biases across languages and tasks.
Contribution
It presents a novel challenge dataset leveraging reflexivization in multiple languages to unambiguously detect gender bias in NLP models, expanding beyond English-focused studies.
Findings
Gender bias is present across all task-language combinations.
Model bias correlates with national labor market statistics.
Reflexivization provides a clear testbed for gender bias detection.
Abstract
The one-sided focus on English in previous studies of gender bias in NLP misses out on opportunities in other languages: English challenge datasets such as GAP and WinoGender highlight model preferences that are "hallucinatory", e.g., disambiguating gender-ambiguous occurrences of 'doctor' as male doctors. We show that for languages with type B reflexivization, e.g., Swedish and Russian, we can construct multi-task challenge datasets for detecting gender bias that lead to unambiguously wrong model predictions: In these languages, the direct translation of 'the doctor removed his mask' is not ambiguous between a coreferential reading and a disjoint reading. Instead, the coreferential reading requires a non-gendered pronoun, and the gendered, possessive pronouns are anti-reflexive. We present a multilingual, multi-task challenge dataset, which spans four languages and four NLP tasks and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
