When Are Learning Biases Equivalent? A Unifying Framework for Fairness, Robustness, and Distribution Shift
Sushant Mehta

TL;DR
This paper introduces a unifying theoretical framework that characterizes when different bias mechanisms in machine learning, such as fairness violations and distribution shifts, produce equivalent effects on model performance, validated across multiple datasets.
Contribution
It formalizes biases as violations of conditional independence using information theory, establishing conditions for their equivalence and enabling transfer of debiasing methods across domains.
Findings
Equivalence between spurious correlation strength and sub-population imbalance ratio.
Validation of theoretical predictions across six datasets and three architectures.
Approximate 3% worst-group accuracy difference within predicted equivalence.
Abstract
Machine learning systems exhibit diverse failure modes: unfairness toward protected groups, brittleness to spurious correlations, poor performance on minority sub-populations, which are typically studied in isolation by distinct research communities. We propose a unifying theoretical framework that characterizes when different bias mechanisms produce quantitatively equivalent effects on model performance. By formalizing biases as violations of conditional independence through information-theoretic measures, we prove formal equivalence conditions relating spurious correlations, subpopulation shift, class imbalance, and fairness violations. Our theory predicts that a spurious correlation of strength produces equivalent worst-group accuracy degradation as a sub-population imbalance ratio under feature overlap assumptions. Empirical validation in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
