Understanding Unfairness in Fraud Detection through Model and Data Bias Interactions
Jos\'e Pombal, Andr\'e F. Cruz, Jo\~ao Bravo, Pedro Saleiro, M\'ario, A.T. Figueiredo, Pedro Bizarro

TL;DR
This paper investigates how interactions between models and data biases contribute to unfairness in fraud detection, revealing specific trade-offs and intervention strategies for different bias scenarios.
Contribution
It introduces a taxonomy of data bias and analyzes fairness-accuracy trade-offs, highlighting the importance of model-data interactions in algorithmic unfairness.
Findings
Bias interactions significantly influence fairness outcomes.
Pre-processing can balance errors in simple bias settings.
Complex biases require more sophisticated mitigation techniques.
Abstract
In recent years, machine learning algorithms have become ubiquitous in a multitude of high-stakes decision-making applications. The unparalleled ability of machine learning algorithms to learn patterns from data also enables them to incorporate biases embedded within. A biased model can then make decisions that disproportionately harm certain groups in society -- limiting their access to financial services, for example. The awareness of this problem has given rise to the field of Fair ML, which focuses on studying, measuring, and mitigating unfairness in algorithmic prediction, with respect to a set of protected groups (e.g., race or gender). However, the underlying causes for algorithmic unfairness still remain elusive, with researchers divided between blaming either the ML algorithms or the data they are trained on. In this work, we maintain that algorithmic unfairness stems from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCorruption and Economic Development · Ethics and Social Impacts of AI · Imbalanced Data Classification Techniques
