Data Augmentation via Subgroup Mixup for Improving Fairness

Madeline Navarro; Camille Little; Genevera I. Allen; Santiago Segarra

arXiv:2309.07110·stat.ML·September 14, 2023

Data Augmentation via Subgroup Mixup for Improving Fairness

Madeline Navarro, Camille Little, Genevera I. Allen, Santiago Segarra

PDF

Open Access

TL;DR

This paper introduces a novel data augmentation technique called subgroup mixup, designed to enhance fairness in machine learning by balancing underrepresented groups and improving decision boundaries across diverse subpopulations.

Contribution

The paper presents a new pairwise mixup method for data augmentation that specifically targets fairness improvements in classification tasks, addressing societal biases and under-representation.

Findings

01

Achieves fair outcomes on synthetic and real-world data

02

Improves fairness without sacrificing accuracy

03

Demonstrates robustness across multiple datasets

Abstract

In this work, we propose data augmentation via pairwise mixup across subgroups to improve group fairness. Many real-world applications of machine learning systems exhibit biases across certain groups due to under-representation or training data that reflects societal biases. Inspired by the successes of mixup for improving classification performance, we develop a pairwise mixup scheme to augment training data and encourage fair and accurate decision boundaries for all subgroups. Data augmentation for group fairness allows us to add new samples of underrepresented groups to balance subpopulations. Furthermore, our method allows us to use the generalization ability of mixup to improve both fairness and accuracy. We compare our proposed mixup to existing data augmentation and bias mitigation approaches on both synthetic simulations and real-world benchmark fair classification data,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI

MethodsMixup