From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive   Language for Bias Reduction in LLMs

Marion Bartl; Susan Leavy

arXiv:2407.04434·cs.CL·July 8, 2024

From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs

Marion Bartl, Susan Leavy

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to reduce gender bias in large language models by fine-tuning them with a dataset of gender-inclusive language, leading to less stereotypical outputs.

Contribution

It develops a novel gender-inclusive fine-tuning dataset and demonstrates its effectiveness in reducing gender stereotypes in multiple LLMs.

Findings

01

Reduced gender-stereotyping tendencies in models after fine-tuning

02

Compiled a catalogue of 692 gender-exclusive and neutral terms

03

Provides a practical approach for gender inclusivity in NLP

Abstract

Gender bias is not only prevalent in Large Language Models (LLMs) and their training data, but also firmly ingrained into the structural aspects of language itself. Therefore, adapting linguistic structures within LLM training data to promote gender-inclusivity can make gender representations within the model more inclusive. The focus of our work are gender-exclusive affixes in English, such as in 'show-girl' or 'man-cave', which can perpetuate gender stereotypes and binary conceptions of gender. We use an LLM training dataset to compile a catalogue of 692 gender-exclusive terms along with gender-neutral variants and from this, develop a gender-inclusive fine-tuning dataset, the 'Tiny Heap'. Fine-tuning three different LLMs with this dataset, we observe an overall reduction in gender-stereotyping tendencies across the models. Our approach provides a practical method for enhancing gender…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs· underline

Taxonomy

TopicsArtificial Intelligence in Law

MethodsFocus