Directional Bias Amplification
Angelina Wang, Olga Russakovsky

TL;DR
This paper introduces a new metric called Directional Bias Amplification to better measure how machine learning models amplify biases from data, addressing shortcomings of previous metrics and discussing its implications.
Contribution
The paper proposes a decoupled, more accurate metric for bias amplification and analyzes its assumptions, limitations, and normative implications.
Findings
The new metric better isolates bias amplification effects.
It highlights the importance of confidence intervals in measurement.
The analysis discusses the normative considerations of bias measurement.
Abstract
Mitigating bias in machine learning systems requires refining our understanding of bias propagation pathways: from societal structures to large-scale data to trained models to impact on society. In this work, we focus on one aspect of the problem, namely bias amplification: the tendency of models to amplify the biases present in the data they are trained on. A metric for measuring bias amplification was introduced in the seminal work by Zhao et al. (2017); however, as we demonstrate, this metric suffers from a number of shortcomings including conflating different types of bias amplification and failing to account for varying base rates of protected attributes. We introduce and analyze a new, decoupled metric for measuring bias amplification, (Directional Bias Amplification). We thoroughly analyze and discuss both the technical assumptions and normative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
