Fine-Tuned LLMs are "Time Capsules" for Tracking Societal Bias Through   Books

Sangmitra Madhusudan; Robert Morabito; Skye Reid; Nikta Gohari Sadr,; and Ali Emami

arXiv:2502.05331·cs.CL·February 14, 2025

Fine-Tuned LLMs are "Time Capsules" for Tracking Societal Bias Through Books

Sangmitra Madhusudan, Robert Morabito, Skye Reid, Nikta Gohari Sadr,, and Ali Emami

PDF

Open Access 1 Repo 2 Videos

TL;DR

This paper introduces BookPAGE, a novel method using fine-tuned LLMs on decade-specific books to track societal bias evolution over time, revealing how biases reflect historical cultural shifts.

Contribution

We develop a new corpus and methodology to analyze societal biases in LLMs based on literary content, highlighting bias trends aligned with historical societal changes.

Findings

01

Biases in LLMs mirror societal changes over decades.

02

Portrayal of women in leadership roles increased over time.

03

References to LGBTQ+ relationships grew significantly from 1980s to 2000s.

Abstract

Books, while often rich in cultural insights, can also mirror societal biases of their eras - biases that Large Language Models (LLMs) may learn and perpetuate during training. We introduce a novel method to trace and quantify these biases using fine-tuned LLMs. We develop BookPAGE, a corpus comprising 593 fictional books across seven decades (1950-2019), to track bias evolution. By fine-tuning LLMs on books from each decade and using targeted prompts, we examine shifts in biases related to gender, sexual orientation, race, and religion. Our findings indicate that LLMs trained on decade-specific books manifest biases reflective of their times, with both gradual trends and notable shifts. For example, model responses showed a progressive increase in the portrayal of women in leadership roles (from 8% to 22%) from the 1950s to 2010s, with a significant uptick in the 1990s (from 4% to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sangmitra-06/BookPAGE
noneOfficial

Videos

Fine-Tuned LLMs are "Time Capsules'' for Tracking Societal Bias Through Books· underline

Fine-Tuned LLMs are ''Time Capsules'' for Tracking Societal Bias Through Books· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection