TL;DR
This paper introduces a novel subpopulation data poisoning attack that is highly effective and stealthy, challenging existing defenses and highlighting vulnerabilities in machine learning systems with diverse datasets.
Contribution
The paper proposes a new subpopulation poisoning attack framework, demonstrates its effectiveness across datasets and models, and proves its resistance to current defenses.
Findings
Subpopulation attacks are highly effective and stealthy.
Existing defenses are largely ineffective against these attacks.
The attack framework can enhance targeted attack strategies.
Abstract
Machine learning systems are deployed in critical settings, but they might fail in unexpected ways, impacting the accuracy of their predictions. Poisoning attacks against machine learning induce adversarial modification of data used by a machine learning algorithm to selectively change its output when it is deployed. In this work, we introduce a novel data poisoning attack called a \emph{subpopulation attack}, which is particularly relevant when datasets are large and diverse. We design a modular framework for subpopulation attacks, instantiate it with different building blocks, and show that the attacks are effective for a variety of datasets and machine learning models. We further optimize the attacks in continuous domains using influence functions and gradient optimization methods. Compared to existing backdoor poisoning attacks, subpopulation attacks have the advantage of inducing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
