Leveraging Large Language Models to Measure Gender Representation Bias in Gendered Language Corpora
Erik Derner, Sara Sansalvador de la Fuente, Yoan Guti\'errez, Paloma Moreda, Nuria Oliver

TL;DR
This paper introduces a novel method using large language models to detect and quantify gender representation bias in training corpora, especially in gendered languages, revealing significant male dominance and potential mitigation strategies.
Contribution
The paper presents a new LLM-based approach for measuring gender representation bias in multilingual corpora, addressing a gap in existing bias detection methods for gendered languages.
Findings
Substantial male-dominant imbalances in training data
Bias in training data influences model outputs
Bias can be mitigated with small-scale counter-bias training
Abstract
Large language models (LLMs) often inherit and amplify social biases embedded in their training data. A prominent social bias is gender bias. In this regard, prior work has mainly focused on gender stereotyping bias - the association of specific roles or traits with a particular gender - in English and on evaluating gender bias in model embeddings or generated outputs. In contrast, gender representation bias - the unequal frequency of references to individuals of different genders - in the training corpora has received less attention. Yet such imbalances in the training data constitute an upstream source of bias that can propagate and intensify throughout the entire model lifecycle. To fill this gap, we propose a novel LLM-based method to detect and quantify gender representation bias in LLM training data in gendered languages, where grammatical gender challenges the applicability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGender Studies in Language
