Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation

Chiara Manna; Afra Alishahi; Fr\'ed\'eric Blain; Eva Vanmassenhove

arXiv:2505.08546·cs.CL·May 14, 2025

Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation

Chiara Manna, Afra Alishahi, Fr\'ed\'eric Blain, Eva Vanmassenhove

PDF

1 Repo

TL;DR

This paper introduces Minimal Pair Accuracy (MPA), a new metric to evaluate how well neural machine translation models incorporate gender cues, revealing biases and attention patterns related to gender disambiguation.

Contribution

The paper proposes MPA as a novel evaluation metric for gender disambiguation in NMT, and analyzes model biases and attention mechanisms regarding gender cues in translation.

Findings

01

Models often ignore gender cues in favor of stereotypes.

02

Anti-stereotypical cases show models favor masculine cues.

03

Attention analysis reveals different responses to masculine and feminine cues.

Abstract

While gender bias in modern Neural Machine Translation (NMT) systems has received much attention, traditional evaluation metrics do not to fully capture the extent to which these systems integrate contextual gender cues. We propose a novel evaluation metric called Minimal Pair Accuracy (MPA), which measures the reliance of models on gender cues for gender disambiguation. MPA is designed to go beyond surface-level gender accuracy metrics by focusing on whether models adapt to gender cues in minimal pairs -- sentence pairs that differ solely in the gendered pronoun, namely the explicit indicator of the target's entity gender in the source language (EN). We evaluate a number of NMT models on the English-Italian (EN--IT) language pair using this metric, we show that they ignore available gender cues in most cases in favor of (statistical) stereotypical gender interpretation. We further show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chiaramanna/gender-cue-integration-MT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need