An Effective Theory of Bias Amplification
Arjun Subramonian, Samuel J. Bell, Levent Sagun, Elvis Dohmatob

TL;DR
This paper develops a rigorous theoretical framework for understanding bias amplification in machine learning, specifically in ridge regression models, revealing how model choices and data properties influence bias and test performance across groups.
Contribution
It provides the first precise analytical theory explaining bias amplification phenomena in ridge regression models, connecting model design, data distribution, and bias outcomes.
Findings
Optimal regularization can prevent bias amplification
Test error disparities between groups can persist despite increased parameters
Theory aligns with empirical observations on synthetic datasets
Abstract
Machine learning models can capture and amplify biases present in data, leading to disparate test performance across social groups. To better understand, evaluate, and mitigate these biases, a deeper theoretical understanding of how model design choices and data distribution properties contribute to bias is needed. In this work, we contribute a precise analytical theory in the context of ridge regression, both with and without random projections, where the former models feedforward neural networks in a simplified regime. Our theory offers a unified and rigorous explanation of machine learning bias, providing insights into phenomena such as bias amplification and minority-group bias in various feature and parameter regimes. For example, we observe that there may be an optimal regularization penalty or training time to avoid bias amplification, and there can be differences in test error…
Peer Reviews
Decision·ICLR 2025 Poster
The proposed theory is solid with the exact formulation of the limiting risk, and it provides a rigorous foundation on the bias amplification phenomenon and its relationship with various components in machine learning models. The discovery is novel and the contribution is important to deepen our understanding of the bias amplification in ML.
1. Although the authors present the exact formulation of the risk in the main text, it is complicated to understand the implications of those formulas. It would be helpful to include more discussion to explain each term to better understand the results. 2. The paper's main contribution is to examine the bias amplification phenomenon using the formula. However, a formal statement about how different components affect the bias amplification is lacking. I would suggest the authors write them in f
This paper examines bias amplification in ridge regression with random projections. Theoretical findings not only predict known bias amplification effects but also reveal new amplification phenomena under isotropic covariance. These findings are further validated through empirical analysis on a complex semi-synthetic image dataset.
- It appears that the analysis of bias amplification heavily relies on an implicit assumption about the conditional dependence heterogeneity (e.g., $w_2\neq w_1$) between the two groups. What would happen if this assumption were relaxed or removed? - The manuscript does not sufficiently justify the definition of “bias amplification.” I think more theoretical backing is needed for this definition. - More theoretical justification is needed to demonstrate that ridge regression with random proje
I believe the topic analyzed in this paper is very relevant, and theoretical contributions in this setting are crucial to understanding how to mitigate the pressing issue of bias amplification. In this context, it is interesting to understand and discuss whether the usual recipe of overparametrization could be more harmful than expected. The authors' analysis seems technically sound, although the methods required the introduction of some modeling assumptions that might be simplistic in some cas
My main concern about this work is its extreme similarity, in goals, modeling setup, and some of the conclusions, with previous work ("Bias-inducing geometries: an exactly solvable data model with fairness implications" https://arxiv.org/abs/2205.15935), which is ignored in the present paper. Notice that this same reference is for example cited in ref. (Bell & Sagun, 2023) within this paper. While I can recognize some differences, reasonably motivating the present extension, there should be a cl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)
MethodsALIGN
