TL;DR
This paper introduces a novel, data-free method for creating universal adversarial perturbations that can fool multiple vision models across various tasks without needing training data, highlighting increased risks for deep learning models.
Contribution
The paper proposes a generalizable, data-free approach for crafting universal adversarial perturbations that work across multiple vision tasks and in black-box scenarios, outperforming data-dependent methods.
Findings
Achieves high fooling rates across multiple vision tasks.
Outperforms data-dependent objectives in black-box settings.
Generalizes without requiring training data.
Abstract
Machine learning models are susceptible to adversarial perturbations: small changes to input that can cause large changes in output. It is also demonstrated that there exist input-agnostic perturbations, called universal adversarial perturbations, which can change the inference of target model on most of the data samples. However, existing methods to craft universal perturbations are (i) task specific, (ii) require samples from the training data distribution, and (iii) perform complex optimizations. Additionally, because of the data dependence, fooling ability of the crafted perturbations is proportional to the available training data. In this paper, we present a novel, generalizable and data-free approaches for crafting universal adversarial perturbations. Independent of the underlying task, our objective achieves fooling via corrupting the extracted features at multiple layers.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
