No evaluation without fair representation : Impact of label and selection bias on the evaluation, performance and mitigation of classification models
Magali Legast, Toon Calders, Fran\c{c}ois Fouss

TL;DR
This paper empirically studies how label and selection biases affect the evaluation, performance, and mitigation of classification models, introducing a framework to model and analyze biased datasets for fairer assessment.
Contribution
It introduces a biasing and evaluation framework to model fair and biased worlds, enabling controlled analysis of bias impacts on model evaluation and mitigation methods.
Findings
Bias type influences model performance impact.
No trade-off between fairness and accuracy on unbiased test sets.
Bias mitigation effectiveness varies with bias type.
Abstract
Bias can be introduced in diverse ways in machine learning datasets, for example via selection or label bias. Although these bias types in themselves have an influence on important aspects of fair machine learning, their different impact has been understudied. In this work, we empirically analyze the effect of label bias and several subtypes of selection bias on the evaluation of classification models, on their performance, and on the effectiveness of bias mitigation methods. We also introduce a biasing and evaluation framework that allows to model fair worlds and their biased counterparts through the introduction of controlled bias in real-life datasets with low discrimination. Using our framework, we empirically analyze the impact of each bias type independently, while obtaining a more representative evaluation of models and mitigation methods than with the traditional use of a subset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
