Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation
Emmanouil Zaranis, Giuseppe Attanasio, Sweta Agrawal, Andr\'e F.T. Martins

TL;DR
This paper investigates gender bias in machine translation quality estimation metrics, revealing significant biases that favor masculine forms and impact downstream translation quality and data filtering.
Contribution
It introduces the first comprehensive analysis of gender bias in QE metrics and demonstrates its effects across multiple languages and domains.
Findings
Masculine translations score higher than feminine ones when gender is undisclosed.
Gender-neutral translations are penalized by biased QE metrics.
Context-aware QE metrics show more errors for feminine referents.
Abstract
Quality estimation (QE)-the automatic assessment of translation quality-has recently become crucial across several stages of the translation pipeline, from data curation to training and decoding. While QE metrics have been optimized to align with human judgments, whether they encode social biases has been largely overlooked. Biased QE risks favoring certain demographic groups over others, e.g., by exacerbating gaps in visibility and usability. This paper defines and investigates gender bias of QE metrics and discusses its downstream implications for machine translation (MT). Experiments with state-of-the-art QE metrics across multiple domains, datasets, and languages reveal significant bias. When a human entity's gender in the source is undisclosed, masculine-inflected translations score higher than feminine-inflected ones, and gender-neutral translations are penalized. Even when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification
MethodsSoftmax · Attention Is All You Need · Focus · ALIGN
