Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts

Elizabeth Schaefer; Kirk Roberts

arXiv:2501.06365·cs.CL·May 29, 2025

Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts

Elizabeth Schaefer, Kirk Roberts

PDF

1 Video

TL;DR

This study develops a bias mitigation pipeline for large language models in medical literature, neutralizing gendered pronouns in PubMed abstracts to promote fairness and reduce gender bias.

Contribution

Introduces MOBERT, a BERT-based model trained on gender-neutralized PubMed abstracts, to reduce gender bias in medical language models.

Findings

01

MOBERT achieved a 70% pronoun replacement rate.

02

Comparison showed MOBERT outperformed 1965BERT with only 4%.

03

Pronoun replacement accuracy linked to occupational term frequency.

Abstract

This paper presents a pipeline for mitigating gender bias in large language models (LLMs) used in medical literature by neutralizing gendered occupational pronouns. A dataset of 379,000 PubMed abstracts from 1965-1980 was processed to identify and modify pronouns tied to professions. We developed a BERT-based model, "Modern Occupational Bias Elimination with Refined Training," or "MOBERT," trained on these neutralized abstracts, and compared its performance with "1965BERT," trained on the original dataset. MOBERT achieved a 70% inclusive replacement rate, while 1965BERT reached only 4%. A further analysis of MOBERT revealed that pronoun replacement accuracy correlated with the frequency of occupational terms in the training data. We propose expanding the dataset and refining the pipeline to improve performance and ensure more equitable language modeling in medical applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts· underline