Influence Based Defense Against Data Poisoning Attacks in Online Learning
Sanjay Seetharaman, Shubham Malaviya, Rosni KV, Manish Shukla, Sachin, Lodha

TL;DR
This paper introduces an influence-based defense mechanism for online learning that combines influence functions with data sanitization to mitigate data poisoning attacks, addressing a gap in online settings where defenses are less explored.
Contribution
The work proposes a novel online defense method using influence functions combined with sanitization, specifically designed to counter data poisoning in sequential data arrival scenarios.
Findings
Effective against multiple attack strategies
Reduces model degradation in online learning
Validated on various datasets
Abstract
Data poisoning is a type of adversarial attack on training data where an attacker manipulates a fraction of data to degrade the performance of machine learning model. Therefore, applications that rely on external data-sources for training data are at a significantly higher risk. There are several known defensive mechanisms that can help in mitigating the threat from such attacks. For example, data sanitization is a popular defensive mechanism wherein the learner rejects those data points that are sufficiently far from the set of training instances. Prior work on data poisoning defense primarily focused on offline setting, wherein all the data is assumed to be available for analysis. Defensive measures for online learning, where data points arrive sequentially, have not garnered similar interest. In this work, we propose a defense mechanism to minimize the degradation caused by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
