Prisoners of Their Own Devices: How Models Induce Data Bias in Performative Prediction
Jos\'e Pombal, Pedro Saleiro, M\'ario A.T. Figueiredo, Pedro Bizarro

TL;DR
This paper explores how machine learning models in performative prediction settings can induce and perpetuate data biases, affecting fairness and performance, with a focus on dynamic data interactions and real-world implications.
Contribution
It introduces a taxonomy for bias in performative prediction and analyzes how model behavior influences data bias, using a fraud detection case study.
Findings
Biases like distribution shifts impact fairness and accuracy.
Model interactions can reinforce existing biases.
Performative prediction requires new fairness considerations.
Abstract
The unparalleled ability of machine learning algorithms to learn patterns from data also enables them to incorporate biases embedded within. A biased model can then make decisions that disproportionately harm certain groups in society. Much work has been devoted to measuring unfairness in static ML environments, but not in dynamic, performative prediction ones, in which most real-world use cases operate. In the latter, the predictive model itself plays a pivotal role in shaping the distribution of the data. However, little attention has been heeded to relating unfairness to these interactions. Thus, to further the understanding of unfairness in these settings, we propose a taxonomy to characterize bias in the data, and study cases where it is shaped by model behaviour. Using a real-world account opening fraud detection case study as an example, we study the dangers to both performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
