On Robustness of Linear Classifiers to Targeted Data Poisoning
Nakshatra Gupta, Sumanth Prabhu, Supratik Chakraborty, R Venkatesh

TL;DR
This paper investigates the robustness of linear classifiers against targeted data poisoning attacks, proving the problem's computational hardness and proposing an efficient bounding technique that effectively assesses dataset vulnerability.
Contribution
It introduces a novel method to compute bounds on dataset robustness to label-flip poisoning attacks for linear classifiers, addressing computational challenges.
Findings
The robustness problem is NP-Complete for linear classifiers.
The proposed technique efficiently computes bounds on dataset robustness.
Poisoning beyond bounds significantly affects classification accuracy.
Abstract
Data poisoning is a training-time attack that undermines the trustworthiness of learned models. In a targeted data poisoning attack, an adversary manipulates the training dataset to alter the classification of a targeted test point. Given the typically large size of training dataset, manual detection of poisoning is difficult. An alternative is to automatically measure a dataset's robustness against such an attack, which is the focus of this paper. We consider a threat model wherein an adversary can only perturb the labels of the training dataset, with knowledge limited to the hypothesis space of the victim's model. In this setting, we prove that finding the robustness is an NP-Complete problem, even when hypotheses are linear classifiers. To overcome this, we present a technique that finds lower and upper bounds of robustness. Our implementation of the technique computes these bounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Smart Grid Security and Resilience
