Bias Dynamics in BabyLMs: Towards a Compute-Efficient Sandbox for Democratising Pre-Training Debiasing
Filip Trhlik, Andrew Caines, Paula Buttery

TL;DR
This paper demonstrates that small, low-cost BabyLMs can effectively model bias formation in large language models, enabling efficient pre-training debiasing research and reducing costs significantly.
Contribution
It introduces BabyLMs as a compute-efficient sandbox that closely mimics bias dynamics of larger models, facilitating democratized and faster debiasing research.
Findings
BabyLMs show similar bias formation patterns to BERT models.
Correlations between BabyLMs and BERT hold across debiasing methods.
Pre-training costs are reduced from over 500 to under 30 GPU-hours.
Abstract
Pre-trained language models (LMs) have, over the last few years, grown substantially in both societal adoption and training costs. This rapid growth in size has constrained progress in understanding and mitigating their biases. Since re-training LMs is prohibitively expensive, most debiasing work has focused on post-hoc or masking-based strategies, which often fail to address the underlying causes of bias. In this work, we seek to democratise pre-model debiasing research by using low-cost proxy models. Specifically, we investigate BabyLMs, compact BERT-like models trained on small and mutable corpora that can approximate bias acquisition and learning dynamics of larger models. We show that BabyLMs display closely aligned patterns of intrinsic bias formation and performance development compared to standard BERT models, despite their drastically reduced size. Furthermore, correlations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Ethics and Social Impacts of AI · Topic Modeling
