Who's the (Multi-)Fairest of Them All: Rethinking Interpolation-Based   Data Augmentation Through the Lens of Multicalibration

Karina Halevy; Karly Hou; Charumathi Badrinath

arXiv:2412.10575·cs.LG·April 16, 2025

Who's the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration

Karina Halevy, Karly Hou, Charumathi Badrinath

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper critically examines interpolation-based data augmentation methods like Fair Mixup, revealing that vanilla Mixup often outperforms them in fairness and accuracy, especially when combined with multicalibration post-processing.

Contribution

It provides a rigorous evaluation of data augmentation methods using multicalibration, showing that simpler vanilla Mixup can outperform more complex fairness-oriented methods.

Findings

01

Vanilla Mixup outperforms Fair Mixup in fairness and accuracy.

02

Multicalibration post-processing can further improve fairness.

03

Fair Mixup often worsens performance compared to baseline.

Abstract

Data augmentation methods, especially SoTA interpolation-based methods such as Fair Mixup, have been widely shown to increase model fairness. However, this fairness is evaluated on metrics that do not capture model uncertainty and on datasets with only one, relatively large, minority group. As a remedy, multicalibration has been introduced to measure fairness while accommodating uncertainty and accounting for multiple minority groups. However, existing methods of improving multicalibration involve reducing initial training data to create a holdout set for post-processing, which is not ideal when minority training data is already sparse. This paper uses multicalibration to more rigorously examine data augmentation for classification fairness. We stress-test four versions of Fair Mixup on two structured data classification problems with up to 81 marginalized groups, evaluating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

enscma2/fairest-mixup
pytorchOfficial

Videos

Who’s the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration· underline

Taxonomy

TopicsAdvanced Data Compression Techniques

MethodsSparse Evolutionary Training · Mixup