Influence Based Defense Against Data Poisoning Attacks in Online   Learning

Sanjay Seetharaman; Shubham Malaviya; Rosni KV; Manish Shukla; Sachin; Lodha

arXiv:2104.13230·cs.LG·April 28, 2021

Influence Based Defense Against Data Poisoning Attacks in Online Learning

Sanjay Seetharaman, Shubham Malaviya, Rosni KV, Manish Shukla, Sachin, Lodha

PDF

Open Access

TL;DR

This paper introduces an influence-based defense mechanism for online learning that combines influence functions with data sanitization to mitigate data poisoning attacks, addressing a gap in online settings where defenses are less explored.

Contribution

The work proposes a novel online defense method using influence functions combined with sanitization, specifically designed to counter data poisoning in sequential data arrival scenarios.

Findings

01

Effective against multiple attack strategies

02

Reduces model degradation in online learning

03

Validated on various datasets

Abstract

Data poisoning is a type of adversarial attack on training data where an attacker manipulates a fraction of data to degrade the performance of machine learning model. Therefore, applications that rely on external data-sources for training data are at a significantly higher risk. There are several known defensive mechanisms that can help in mitigating the threat from such attacks. For example, data sanitization is a popular defensive mechanism wherein the learner rejects those data points that are sufficiently far from the set of training instances. Prior work on data poisoning defense primarily focused on offline setting, wherein all the data is assumed to be available for analysis. Defensive measures for online learning, where data points arrive sequentially, have not garnered similar interest. In this work, we propose a defense mechanism to minimize the degradation caused by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification