On the Vulnerability of Fairness Constrained Learning to Malicious Noise
Avrim Blum, Princewill Okoroafor, Aadirupa Saha, Kevin Stangl

TL;DR
This paper investigates how randomized classifiers can mitigate the impact of malicious noise on fairness-constrained learning, showing nuanced bounds on accuracy loss for various fairness notions.
Contribution
It demonstrates that allowing randomization in classifiers leads to tighter bounds on accuracy loss under malicious noise, contrasting prior results for proper learners.
Findings
For Demographic Parity, loss is Θ(α), matching unconstrained bounds.
For Equal Opportunity, loss is Θ(√α), with matching lower bounds.
Additional fairness notions show three regimes: O(α), O(√α), and O(1).
Abstract
We consider the vulnerability of fairness-constrained learning to small amounts of malicious noise in the training data. Konstantinov and Lampert (2021) initiated the study of this question and presented negative results showing there exist data distributions where for several fairness constraints, any proper learner will exhibit high vulnerability when group sizes are imbalanced. Here, we present a more optimistic view, showing that if we allow randomized classifiers, then the landscape is much more nuanced. For example, for Demographic Parity we show we can incur only a loss in accuracy, where is the malicious noise rate, matching the best possible even without fairness constraints. For Equal Opportunity, we show we can incur an loss, and give a matching lower bound. In contrast, Konstantinov and Lampert (2021) showed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
