Generating Poisoning Attacks against Ridge Regression Models with Categorical Features
Monse Guedes-Ayala, Lars Schewe, Zeynep Suvak, Miguel Anjos

TL;DR
This paper introduces a novel algorithm to generate effective poisoning attacks on ridge regression models with both numerical and categorical features, enhancing understanding of model vulnerabilities.
Contribution
It formulates the poisoning attack as a bilevel optimization problem and proposes a new algorithm specifically targeting categorical features modeled as SOS-1 sets.
Findings
Improves mean squared error across datasets
Outperforms previous benchmarks
Effectively poisons models with categorical features
Abstract
Machine Learning (ML) models have become a very powerful tool to extract information from large datasets and use it to make accurate predictions and automated decisions. However, ML models can be vulnerable to external attacks, causing them to underperform or deviate from their expected tasks. One way to attack ML models is by injecting malicious data to mislead the algorithm during the training phase, which is referred to as a poisoning attack. We can prepare for such situations by designing anticipated attacks, which are later used for creating and testing defence strategies. In this paper, we propose an algorithm to generate strong poisoning attacks for a ridge regression model containing both numerical and categorical features that explicitly models and poisons categorical features. We model categorical features as SOS-1 sets and formulate the problem of designing poisoning attacks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
