Data Poisoning Attacks against Online Learning
Yizhen Wang, Kamalika Chaudhuri

TL;DR
This paper investigates data poisoning attacks in online learning, formalizing the problem, proposing attack strategies, and evaluating their effectiveness to inform better defenses against such adversarial threats.
Contribution
It introduces a systematic framework for data poisoning in online learning, including formal problem definitions, attack strategies, and extensive experimental analysis.
Findings
Attack strategies can significantly degrade online learning performance.
Proposed methods are effective across various online learning scenarios.
Insights suggest new directions for developing robust defenses.
Abstract
We consider data poisoning attacks, a class of adversarial attacks on machine learning where an adversary has the power to alter a small fraction of the training data in order to make the trained classifier satisfy certain objectives. While there has been much prior work on data poisoning, most of it is in the offline setting, and attacks for online learning, where training data arrives in a streaming manner, are not well understood. In this work, we initiate a systematic investigation of data poisoning attacks for online learning. We formalize the problem into two settings, and we propose a general attack strategy, formulated as an optimization problem, that applies to both with some modifications. We propose three solution strategies, and perform extensive experimental evaluation. Finally, we discuss the implications of our findings for building successful defenses.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
