Bias Dynamics in BabyLMs: Towards a Compute-Efficient Sandbox for Democratising Pre-Training Debiasing

Filip Trhlik; Andrew Caines; Paula Buttery

arXiv:2601.09421·cs.CL·January 16, 2026

Bias Dynamics in BabyLMs: Towards a Compute-Efficient Sandbox for Democratising Pre-Training Debiasing

Filip Trhlik, Andrew Caines, Paula Buttery

PDF

Open Access 1 Datasets

TL;DR

This paper demonstrates that small, low-cost BabyLMs can effectively model bias formation in large language models, enabling efficient pre-training debiasing research and reducing costs significantly.

Contribution

It introduces BabyLMs as a compute-efficient sandbox that closely mimics bias dynamics of larger models, facilitating democratized and faster debiasing research.

Findings

01

BabyLMs show similar bias formation patterns to BERT models.

02

Correlations between BabyLMs and BERT hold across debiasing methods.

03

Pre-training costs are reduced from over 500 to under 30 GPU-hours.

Abstract

Pre-trained language models (LMs) have, over the last few years, grown substantially in both societal adoption and training costs. This rapid growth in size has constrained progress in understanding and mitigating their biases. Since re-training LMs is prohibitively expensive, most debiasing work has focused on post-hoc or masking-based strategies, which often fail to address the underlying causes of bias. In this work, we seek to democratise pre-model debiasing research by using low-cost proxy models. Specifically, we investigate BabyLMs, compact BERT-like models trained on small and mutable corpora that can approximate bias acquisition and learning dynamics of larger models. We show that BabyLMs display closely aligned patterns of intrinsic bias formation and performance development compared to standard BERT models, despite their drastically reduced size. Furthermore, correlations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

suchirsalhan/babylm-detox
dataset· 15 dl
15 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Ethics and Social Impacts of AI · Topic Modeling