TL;DR
This paper investigates data poisoning attacks on regression models, demonstrating their effectiveness in real-world scenarios and proposing a new defense strategy that significantly reduces attack impact across multiple datasets.
Contribution
It introduces a novel black-box poisoning attack for regression and a comprehensive defense strategy, expanding understanding beyond classification-focused studies.
Findings
Poisoning increases MSE by 150% with only 2% poisoned data
The proposed defense effectively mitigates attacks across 26 datasets
Real-world medical case demonstrates attack feasibility
Abstract
Adversarial data poisoning is an effective attack against machine learning and threatens model integrity by introducing poisoned data into the training dataset. So far, it has been studied mostly for classification, even though regression learning is used in many mission critical systems (such as dosage of medication, control of cyber-physical systems and managing power supply). Therefore, in the present research, we aim to evaluate all aspects of data poisoning attacks on regression learning, exceeding previous work both in terms of breadth and depth. We present realistic scenarios in which data poisoning attacks threaten production systems and introduce a novel black-box attack, which is then applied to a real-word medical use-case. As a result, we observe that the mean squared error (MSE) of the regressor increases to 150 percent due to inserting only two percent of poison samples.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
