Uncertainty Quantification for Evaluating Machine Translation Bias

Ieva Raminta Stali\=unait\.e; Julius Cheng; Andreas Vlachos

arXiv:2507.18338·cs.CL·October 22, 2025

Uncertainty Quantification for Evaluating Machine Translation Bias

Ieva Raminta Stali\=unait\.e, Julius Cheng, Andreas Vlachos

PDF

Open Access

TL;DR

This paper explores how uncertainty quantification in machine translation can be used to detect gender bias, especially in ambiguous cases, revealing that high accuracy does not imply proper bias mitigation.

Contribution

It introduces a novel approach using semantic uncertainty to measure gender bias in MT, applicable to ambiguous and unambiguous sentences, and analyzes the impact of debiasing.

Findings

01

Uncertainty can reveal bias in ambiguous translations.

02

High accuracy does not guarantee unbiased translations.

03

Debiasing affects ambiguous and unambiguous cases differently.

Abstract

The predictive uncertainty of machine translation (MT) models is typically used as a quality estimation proxy. In this work, we posit that apart from confidently translating when a single correct translation exists, models should also maintain uncertainty when the input is ambiguous. We use uncertainty to measure gender bias in MT systems. When the source sentence includes a lexeme whose gender is not overtly marked, but whose target-language equivalent requires gender specification, the model must infer the appropriate gender from the context and can be susceptible to biases. Prior work measured bias via gender accuracy, however it cannot be applied to ambiguous cases. Using semantic uncertainty, we are able to assess bias when translating both ambiguous and unambiguous source sentences, and find that high translation accuracy does not correlate with exhibiting uncertainty…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuzzy Logic and Control Systems