Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations
Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf,, Xue Lin, Sijia Liu

TL;DR
This paper introduces a novel few-shot learning approach using bilevel optimization and learning-to-optimize techniques to generate universal adversarial perturbations effective across diverse image sources and models.
Contribution
It proposes a new meta-learning framework combining MAML and L2O for improved UAP generation across multiple image sources and models.
Findings
50% higher attack success rate than baselines
37% faster UAP generation than existing methods
Effective across different image sources and victim models
Abstract
Adversarial perturbations are critical for certifying the robustness of deep learning models. A universal adversarial perturbation (UAP) can simultaneously attack multiple images, and thus offers a more unified threat model, obviating an image-wise attack algorithm. However, the existing UAP generator is underdeveloped when images are drawn from different image sources (e.g., with different image resolutions). Towards an authentic universality across image sources, we take a novel view of UAP generation as a customized instance of few-shot learning, which leverages bilevel optimization and learning-to-optimize (L2O) techniques for UAP generation with improved attack success rate (ASR). We begin by considering the popular model agnostic meta-learning (MAML) framework to meta-learn a UAP generator. However, we see that the MAML framework does not directly offer the universal attack across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research
MethodsStochastic Gradient Descent · Model-Agnostic Meta-Learning
