ProxiMix: Enhancing Fairness with Proximity Samples in Subgroups

Jingyu Hu; Jun Hong; Mengnan Du; Weiru Liu

arXiv:2410.01145·cs.LG·October 3, 2024

ProxiMix: Enhancing Fairness with Proximity Samples in Subgroups

Jingyu Hu, Jun Hong, Mengnan Du, Weiru Liu

PDF

Open Access

TL;DR

ProxiMix is a novel data augmentation method that improves fairness in machine learning by maintaining proximity relationships during label generation, effectively reducing biases across multiple datasets and models.

Contribution

It introduces ProxiMix, a new pre-processing strategy combining mixup and proximity-aware label generation to enhance fairness in ML models.

Findings

01

ProxiMix improves fairness in predictions.

02

ProxiMix enhances fairness of recourse.

03

Effective across various datasets and models.

Abstract

Many bias mitigation methods have been developed for addressing fairness issues in machine learning. We found that using linear mixup alone, a data augmentation technique, for bias mitigation, can still retain biases present in dataset labels. Research presented in this paper aims to address this issue by proposing a novel pre-processing strategy in which both an existing mixup method and our new bias mitigation algorithm can be utilized to improve the generation of labels of augmented samples, which are proximity aware. Specifically, we proposed ProxiMix which keeps both pairwise and proximity relationships for fairer data augmentation. We conducted thorough experiments with three datasets, three ML models, and different hyperparameters settings. Our experimental results showed the effectiveness of ProxiMix from both fairness of predictions and fairness of recourse perspectives.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Bayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)

MethodsMixup