Systematic analysis of the impact of label noise correction on ML   Fairness

I. Oliveira e Silva; C. Soares; I. Sousa; R. Ghani

arXiv:2306.15994·cs.LG·June 29, 2023

Systematic analysis of the impact of label noise correction on ML Fairness

I. Oliveira e Silva, C. Soares, I. Sousa, R. Ghani

PDF

Open Access 1 Repo

TL;DR

This paper systematically evaluates how different label noise correction methods impact fairness in machine learning models trained on biased data, providing insights into their effectiveness and trade-offs.

Contribution

It introduces an empirical methodology to assess label noise correction techniques for fairness, applying it to six methods across multiple datasets.

Findings

01

Hybrid Label Noise Correction balances fairness and accuracy well

02

Clustering-Based Correction reduces discrimination most but lowers performance

03

Methodology can be applied to fairness benchmarks and standard datasets

Abstract

Arbitrary, inconsistent, or faulty decision-making raises serious concerns, and preventing unfair models is an increasingly important challenge in Machine Learning. Data often reflect past discriminatory behavior, and models trained on such data may reflect bias on sensitive attributes, such as gender, race, or age. One approach to developing fair models is to preprocess the training data to remove the underlying biases while preserving the relevant information, for example, by correcting biased labels. While multiple label noise correction methods are available, the information about their behavior in identifying discrimination is very limited. In this work, we develop an empirical methodology to systematically evaluate the effectiveness of label noise correction techniques in ensuring the fairness of models trained on biased datasets. Our methodology involves manipulating the amount…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

reluzita/fair-lnc-evaluation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms