TL;DR
This paper reveals that availability attacks exploit linearly separable perturbations as shortcuts, enabling efficient generation of effective attacks and highlighting the prevalence of shortcut learning in deep models.
Contribution
It uncovers the linear separability property of availability attack perturbations and demonstrates synthetic attacks are as effective and easier to generate.
Findings
Availability perturbations are almost linearly separable with target labels.
Synthetic linearly-separable perturbations are as powerful as crafted attacks.
Shortcut learning is more widespread in deep models than previously thought.
Abstract
Availability attacks, which poison the training data with imperceptible perturbations, can make the data \emph{not exploitable} by machine learning algorithms so as to prevent unauthorized use of data. In this work, we investigate why these perturbations work in principle. We are the first to unveil an important population property of the perturbations of these attacks: they are almost \textbf{linearly separable} when assigned with the target labels of the corresponding samples, which hence can work as \emph{shortcuts} for the learning objective. We further verify that linear separability is indeed the workhorse for availability attacks. We synthesize linearly-separable perturbations as attacks and show that they are as powerful as the deliberately crafted attacks. Moreover, such synthetic perturbations are much easier to generate. For example, previous attacks need dozens of hours to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
