Poisoning Attacks on Fair Machine Learning
Minh-Hao Van, Wei Du, Xintao Wu, Aidong Lu

TL;DR
This paper introduces a versatile poisoning attack framework targeting fair machine learning models, capable of degrading both accuracy and fairness across various fairness notions through three online attack methods.
Contribution
It presents the first comprehensive poisoning attack framework specifically designed for fair machine learning models, adaptable to multiple fairness criteria.
Findings
Effective reduction in model accuracy and fairness violations.
Three online attack methods successfully generate poisoning samples.
Framework demonstrates high efficiency and adaptability in experiments.
Abstract
Both fair machine learning and adversarial learning have been extensively studied. However, attacking fair machine learning models has received less attention. In this paper, we present a framework that seeks to effectively generate poisoning samples to attack both model accuracy and algorithmic fairness. Our attacking framework can target fair machine learning models trained with a variety of group based fairness notions such as demographic parity and equalized odds. We develop three online attacks, adversarial sampling , adversarial labeling, and adversarial feature modification. All three attacks effectively and efficiently produce poisoning samples via sampling, labeling, or modifying a fraction of training data in order to reduce the test accuracy. Our framework enables attackers to flexibly adjust the attack's focus on prediction accuracy or fairness and accurately quantify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
MethodsTest
