Gradient-based Data Subversion Attack Against Binary Classifiers
Rosni K Vasu, Sanjay Seetharaman, Shubham Malaviya, Manish Shukla,, Sachin Lodha

TL;DR
This paper introduces gradient-based label contamination methods to efficiently degrade binary classifier performance with limited attacker knowledge, highlighting vulnerabilities in machine learning systems.
Contribution
It proposes novel gradient-based strategies for label poisoning attacks that are effective even with limited knowledge of the target model.
Findings
Outperforms baseline attack methods in effectiveness
Efficient in computational resources
Demonstrates vulnerability of binary classifiers to label contamination
Abstract
Machine learning based data-driven technologies have shown impressive performances in a variety of application domains. Most enterprises use data from multiple sources to provide quality applications. The reliability of the external data sources raises concerns for the security of the machine learning techniques adopted. An attacker can tamper the training or test datasets to subvert the predictions of models generated by these techniques. Data poisoning is one such attack wherein the attacker tries to degrade the performance of a classifier by manipulating the training data. In this work, we focus on label contamination attack in which an attacker poisons the labels of data to compromise the functionality of the system. We develop Gradient-based Data Subversion strategies to achieve model degradation under the assumption that the attacker has limited-knowledge of the victim model. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection
