GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns
Enzo Doyen, Amalia Todirascu

TL;DR
This paper introduces GeNRe, the first French gender-neutral rewriting system utilizing collective nouns, combining rule-based and language model approaches to mitigate gender bias in NLP.
Contribution
It presents a novel French gender-neutral rewriting system using collective nouns, with a rule-based core and fine-tuned language models, including the use of instruct-based models for improved performance.
Findings
Claude 3 Opus with dictionary performs close to rule-based system
The system effectively neutralizes gender bias in French texts
Combines rule-based and model-based approaches for better results
Abstract
A significant portion of the textual data used in the field of Natural Language Processing (NLP) exhibits gender biases, particularly due to the use of masculine generics (masculine words that are supposed to refer to mixed groups of men and women), which can perpetuate and amplify stereotypes. Gender rewriting, an NLP task that involves automatically detecting and replacing gendered forms with neutral or opposite forms (e.g., from masculine to feminine), can be employed to mitigate these biases. While such systems have been developed in a number of languages (English, Arabic, Portuguese, German, French), automatic use of gender neutralization techniques (as opposed to inclusive or gender-switching techniques) has only been studied for English. This paper presents GeNRe, the very first French gender-neutral rewriting system using collective nouns, which are gender-fixed in French. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Topic Modeling
