On Adversarial Bias and the Robustness of Fair Machine Learning

Hongyan Chang; Ta Duy Nguyen; Sasi Kumar Murakonda; Ehsan Kazemi; Reza; Shokri

arXiv:2006.08669·stat.ML·June 17, 2020·37 cites

On Adversarial Bias and the Robustness of Fair Machine Learning

Hongyan Chang, Ta Duy Nguyen, Sasi Kumar Murakonda, Ehsan Kazemi, Reza, Shokri

PDF

Open Access 1 Repo

TL;DR

This paper investigates how adversarial attacks can compromise the robustness of fair machine learning models, especially those using equalized odds, revealing vulnerabilities that can reduce accuracy and fairness despite fairness constraints.

Contribution

It provides an analysis of data poisoning attacks against group-based fair ML, highlighting conflicts between fairness and robustness, with empirical evaluation across multiple algorithms and datasets.

Findings

01

Adversarial sampling and labeling attacks can significantly reduce test accuracy.

02

Such attacks can increase the fairness gap on test data.

03

Fair models can still be vulnerable despite satisfying fairness constraints on training data.

Abstract

Optimizing prediction accuracy can come at the expense of fairness. Towards minimizing discrimination against a group, fair machine learning algorithms strive to equalize the behavior of a model across different groups, by imposing a fairness constraint on models. However, we show that giving the same importance to groups of different sizes and distributions, to counteract the effect of bias in training data, can be in conflict with robustness. We analyze data poisoning attacks against group-based fair machine learning, with the focus on equalized odds. An adversary who can control sampling or labeling for a fraction of training data, can reduce the test accuracy significantly beyond what he can achieve on unconstrained models. Adversarial sampling and adversarial labeling attacks can also worsen the model's fairness gap on test data, even though the model satisfies the fairness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

privacytrustlab/adversarial_bias
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)