Suspicion-Free Adversarial Attacks on Clustering Algorithms
Anshuman Chhabra, Abhishek Roy, Prasant Mohapatra

TL;DR
This paper introduces a novel black-box adversarial attack on clustering algorithms, specifically targeting linearly separable clusters, by perturbing a single sample to cause widespread misclustering without detection.
Contribution
It presents the first theoretical and empirical method for spill-over adversarial attacks on clustering that do not require knowledge of the true metric or detection of outliers.
Findings
Attack successfully causes misclustering in multiple datasets
Perturbed samples are not outliers, ensuring stealth
Theoretical proof of attack effectiveness without metric knowledge
Abstract
Clustering algorithms are used in a large number of applications and play an important role in modern machine learning-- yet, adversarial attacks on clustering algorithms seem to be broadly overlooked unlike supervised learning. In this paper, we seek to bridge this gap by proposing a black-box adversarial attack for clustering models for linearly separable clusters. Our attack works by perturbing a single sample close to the decision boundary, which leads to the misclustering of multiple unperturbed samples, named spill-over adversarial samples. We theoretically show the existence of such adversarial samples for the K-Means clustering. Our attack is especially strong as (1) we ensure the perturbed sample is not an outlier, hence not detectable, and (2) the exact metric used for clustering is not known to the attacker. We theoretically justify that the attack can indeed be successful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
