FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation
Jinhee Jang, Juhwan Choi, Dongjin Lee, Seunguk Yu, Youngbin Kim

TL;DR
FairQE is a multi-agent framework that reduces gender bias in translation quality estimation by detecting gender cues, generating variants, and combining scores with bias-mitigating reasoning, improving fairness without losing accuracy.
Contribution
Introduces FairQE, a novel multi-agent, fairness-aware QE framework that effectively mitigates gender bias in translation evaluation while maintaining strong performance.
Findings
FairQE improves gender fairness over strong QE baselines.
It achieves competitive or better overall QE performance.
The framework effectively reduces gender bias in various evaluation settings.
Abstract
Quality Estimation (QE) aims to assess machine translation quality without reference translations, but recent studies have shown that existing QE models exhibit systematic gender bias. In particular, they tend to favor masculine realizations in gender-ambiguous contexts and may assign higher scores to gender-misaligned translations even when gender is explicitly specified. To address these issues, we propose FairQE, a multi-agent-based, fairness-aware QE framework that mitigates gender bias in both gender-ambiguous and gender-explicit scenarios. FairQE detects gender cues, generates gender-flipped translation variants, and combines conventional QE scores with LLM-based bias-mitigating reasoning through a dynamic bias-aware aggregation mechanism. This design preserves the strengths of existing QE models while calibrating their gender-related biases in a plug-and-play manner. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
