Mitigating Bias in Queer Representation within Large Language Models: A   Collaborative Agent Approach

Tianyi Huang; Arya Somasundaram

arXiv:2411.07656·cs.CL·December 3, 2024

Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent Approach

Tianyi Huang, Arya Somasundaram

PDF

Open Access 1 Repo

TL;DR

This paper presents a collaborative agent framework to reduce gender bias in large language models, significantly improving inclusive pronoun usage and promoting fairness in AI-generated content.

Contribution

It introduces a multi-agent pipeline for bias detection and correction in pronoun usage, advancing methods for inclusive language in LLM outputs.

Findings

01

Achieved a 32.6 percentage point increase in correct inclusive pronoun classification.

02

Significantly reduced inappropriate gendered pronoun usage in LLM outputs.

03

Demonstrated the effectiveness of agent-driven frameworks in promoting fairness.

Abstract

Large Language Models (LLMs) often perpetuate biases in pronoun usage, leading to misrepresentation or exclusion of queer individuals. This paper addresses the specific problem of biased pronoun usage in LLM outputs, particularly the inappropriate use of traditionally gendered pronouns ("he," "she") when inclusive language is needed to accurately represent all identities. We introduce a collaborative agent pipeline designed to mitigate these biases by analyzing and optimizing pronoun usage for inclusivity. Our multi-agent framework includes specialized agents for both bias detection and correction. Experimental evaluations using the Tango dataset-a benchmark focused on gender pronoun usage-demonstrate that our approach significantly improves inclusive pronoun classification, achieving a 32.6 percentage point increase over GPT-4o in correctly disagreeing with inappropriate traditionally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tonyhrule/queer-bias-llms
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques