Data Poisoning Attacks on Regression Learning and Corresponding Defenses

Nicolas Michael M\"uller; Daniel Kowatsch; Konstantin B\"ottinger

arXiv:2009.07008·cs.LG·September 16, 2020

Data Poisoning Attacks on Regression Learning and Corresponding Defenses

Nicolas Michael M\"uller, Daniel Kowatsch, Konstantin B\"ottinger

PDF

2 Repos

TL;DR

This paper investigates data poisoning attacks on regression models, demonstrating their effectiveness in real-world scenarios and proposing a new defense strategy that significantly reduces attack impact across multiple datasets.

Contribution

It introduces a novel black-box poisoning attack for regression and a comprehensive defense strategy, expanding understanding beyond classification-focused studies.

Findings

01

Poisoning increases MSE by 150% with only 2% poisoned data

02

The proposed defense effectively mitigates attacks across 26 datasets

03

Real-world medical case demonstrates attack feasibility

Abstract

Adversarial data poisoning is an effective attack against machine learning and threatens model integrity by introducing poisoned data into the training dataset. So far, it has been studied mostly for classification, even though regression learning is used in many mission critical systems (such as dosage of medication, control of cyber-physical systems and managing power supply). Therefore, in the present research, we aim to evaluate all aspects of data poisoning attacks on regression learning, exceeding previous work both in terms of breadth and depth. We present realistic scenarios in which data poisoning attacks threaten production systems and introduce a novel black-box attack, which is then applied to a real-word medical use-case. As a result, we observe that the mean squared error (MSE) of the regressor increases to 150 percent due to inserting only two percent of poison samples.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.