Towards Fair Classification against Poisoning Attacks

Han Xu; Xiaorui Liu; Yuxuan Wan; Jiliang Tang

arXiv:2210.09503·cs.LG·October 19, 2022

Towards Fair Classification against Poisoning Attacks

Han Xu, Xiaorui Liu, Yuxuan Wan, Jiliang Tang

PDF

Open Access

TL;DR

This paper reveals that fair classification models are highly vulnerable to poisoning attacks and proposes a theoretically grounded defense framework that improves robustness in accuracy and fairness.

Contribution

It introduces a general defense framework for fair classification against poisoning attacks, extending traditional defenses with theoretical guarantees.

Findings

01

Fair classifiers are vulnerable to poisoning attacks affecting fairness and accuracy.

02

The proposed defense framework outperforms baseline methods in robustness.

03

Extensive experiments validate the effectiveness of the proposed approach.

Abstract

Fair classification aims to stress the classification models to achieve the equality (treatment or prediction quality) among different sensitive groups. However, fair classification can be under the risk of poisoning attacks that deliberately insert malicious training samples to manipulate the trained classifiers' performance. In this work, we study the poisoning scenario where the attacker can insert a small fraction of samples into training data, with arbitrary sensitive attributes as well as other predictive features. We demonstrate that the fairly trained classifiers can be greatly vulnerable to such poisoning attacks, with much worse accuracy & fairness trade-off, even when we apply some of the most effective defenses (originally proposed to defend traditional classification tasks). As countermeasures to defend fair classification tasks, we propose a general and theoretically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning