On Evaluating and Mitigating Gender Biases in Multilingual Settings
Aniket Vashishtha, Kabir Ahuja, Sunayana Sitaram

TL;DR
This paper develops a benchmark and evaluates bias mitigation techniques for gender biases in multilingual language models, focusing on Indian languages, addressing the lack of resources beyond English.
Contribution
It introduces a new benchmark for gender bias evaluation in Indian languages and extends debiasing methods to multilingual models, filling a critical resource gap.
Findings
Created a human-annotated bias evaluation benchmark for Indian languages
Extended debiasing techniques to work with multilingual models
Identified challenges in bias mitigation across diverse languages
Abstract
While understanding and removing gender biases in language models has been a long-standing problem in Natural Language Processing, prior research work has primarily been limited to English. In this work, we investigate some of the challenges with evaluating and mitigating biases in multilingual settings which stem from a lack of existing benchmarks and resources for bias evaluation beyond English especially for non-western context. In this paper, we first create a benchmark for evaluating gender biases in pre-trained masked language models by extending DisCo to different Indian languages using human annotations. We extend various debiasing methods to work beyond English and evaluate their effectiveness for SOTA massively multilingual models on our proposed metric. Overall, our work highlights the challenges that arise while studying social biases in multilingual settings and provides…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification
