What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine   Translation with a Human-centered Study

Beatrice Savoldi; Sara Papi; Matteo Negri; Ana Guerberof and; Luisa Bentivogli

arXiv:2410.00545·cs.CL·October 8, 2024

What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study

Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof and, Luisa Bentivogli

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This study demonstrates that gender bias in machine translation causes tangible harms, such as increased effort and costs for women, highlighting the need for human-centered evaluations over automatic metrics.

Contribution

It provides the first extensive human-centered analysis linking gender bias in MT to real-world costs and shows current bias metrics are insufficient.

Findings

01

Feminine post-editing requires more effort and time.

02

Bias leads to higher financial costs for women.

03

Current automatic bias measures do not reflect actual disparities.

Abstract

Gender bias in machine translation (MT) is recognized as an issue that can harm people and society. And yet, advancements in the field rarely involve people, the final MT users, or inform how they might be impacted by biased technologies. Current evaluations are often restricted to automatic methods, which offer an opaque estimate of what the downstream impact of gender disparities might be. We conduct an extensive human-centered study to examine if and to what extent bias in MT brings harms with tangible costs, such as quality of service gaps across women and men. To this aim, we collect behavioral data from 90 participants, who post-edited MT outputs to ensure correct gender translation. Across multiple datasets, languages, and types of users, our study shows that feminine post-editing demands significantly more technical and temporal effort, also corresponding to higher financial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bsavoldi/post-edit_guidelines
noneOfficial

Datasets

FBK-MT/gender-bias-PE
dataset· 4 dl
4 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI

Methodstravel james