Sample Selection for Fair and Robust Training

Yuji Roh; Kangwook Lee; Steven Euijong Whang; Changho Suh

arXiv:2110.14222·cs.LG·October 28, 2021·1 cites

Sample Selection for Fair and Robust Training

Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh

PDF

Open Access 1 Video

TL;DR

This paper introduces a greedy sample selection algorithm that enhances fairness and robustness in AI training, effectively balancing unbiased learning with data corruption resilience without altering existing training procedures.

Contribution

It formulates a novel combinatorial optimization problem for fair and robust sample selection and proposes an efficient greedy algorithm that outperforms or matches state-of-the-art methods.

Findings

01

Achieves better or comparable fairness and robustness on synthetic and real datasets.

02

Operates without changing the core training algorithm or requiring additional clean data.

Abstract

Fairness and robustness are critical elements of Trustworthy AI that need to be addressed together. Fairness is about learning an unbiased model while robustness is about learning from corrupted data, and it is known that addressing only one of them may have an adverse affect on the other. In this work, we propose a sample selection-based algorithm for fair and robust training. To this end, we formulate a combinatorial optimization problem for the unbiased selection of samples in the presence of data corruption. Observing that solving this optimization problem is strongly NP-hard, we propose a greedy algorithm that is efficient and effective in practice. Experiments show that our algorithm obtains fairness and robustness that are better than or comparable to the state-of-the-art technique, both on synthetic and benchmark real datasets. Moreover, unlike other fair and robust training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sample Selection for Fair and Robust Training· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)