Adaptive Sampling for Minimax Fair Classification

Shubhanshu Shekhar; Greg Fields; Mohammad Ghavamzadeh; Tara Javidi

arXiv:2103.00755·cs.LG·July 21, 2021·5 cites

Adaptive Sampling for Minimax Fair Classification

Shubhanshu Shekhar, Greg Fields, Mohammad Ghavamzadeh, Tara Javidi

PDF

Open Access 1 Video

TL;DR

This paper introduces an adaptive sampling method to construct training datasets that improve fairness in classifiers, providing theoretical bounds and validating effectiveness on synthetic and real-world tasks.

Contribution

It proposes a novel adaptive sampling algorithm for fair classification, with theoretical performance bounds and practical heuristics for large-scale applications.

Findings

01

Adaptive sampling improves fairness in classifiers.

02

Theoretical bounds show optimality limits of the method.

03

Experimental results demonstrate effectiveness on diverse tasks.

Abstract

Machine learning models trained on uncurated datasets can often end up adversely affecting inputs belonging to underrepresented groups. To address this issue, we consider the problem of adaptively constructing training sets which allow us to learn classifiers that are fair in a minimax sense. We first propose an adaptive sampling algorithm based on the principle of optimism, and derive theoretical bounds on its performance. We also propose heuristic extensions of this algorithm suitable for application to large scale, practical problems. Next, by deriving algorithm independent lower-bounds for a specific class of problems, we show that the performance achieved by our adaptive scheme cannot be improved in general. We then validate the benefits of adaptively constructing training sets via experiments on synthetic tasks with logistic regression classifiers, as well as on several real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Adaptive Sampling for Minimax Fair Classification· slideslive

Taxonomy

TopicsImbalanced Data Classification Techniques · Privacy-Preserving Technologies in Data · Ethics and Social Impacts of AI

MethodsLogistic Regression