Man Made Language Models? Evaluating LLMs' Perpetuation of Masculine Generics Bias
Enzo Doyen, Amalia Todirascu

TL;DR
This study evaluates how large language models (LLMs) perpetuate masculine generics bias in responses to instructions, revealing that nearly 40% of responses are biased and LLMs are hesitant to use gender-fair language.
Contribution
The paper is the first to analyze LLM responses to generic instructions for gender bias, focusing on French and masculine generics, providing new insights into bias propagation.
Findings
Approximately 39.5% of responses are MG-biased.
73.1% of responses with human nouns are MG-biased.
LLMs show reluctance to use gender-fair language spontaneously.
Abstract
Large language models (LLMs) have been shown to propagate and even amplify gender bias, in English and other languages, in specific or constrained contexts. However, no studies so far have focused on gender biases conveyed by LLMs' responses to generic instructions, especially with regard to masculine generics (MG). MG are a linguistic feature found in many gender-marked languages, denoting the use of the masculine gender as a "default" or supposedly neutral gender to refer to mixed group of men and women, or of a person whose gender is irrelevant or unknown. Numerous psycholinguistics studies have shown that MG are not neutral and induce gender bias. This work aims to analyze the use of MG by both proprietary and local LLMs in responses to generic instructions and evaluate their MG bias rate. We focus on French and create a human noun database from existing lexical resources. We filter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
MethodsFocus
