On Distinctive Properties of Universal Perturbations

Sung Min Park; Kuo-An Wei; Kai Xiao; Jerry Li; Aleksander Madry

arXiv:2112.15329·cs.LG·January 3, 2022

On Distinctive Properties of Universal Perturbations

Sung Min Park, Kuo-An Wei, Kai Xiao, Jerry Li, Aleksander Madry

PDF

Open Access

TL;DR

This paper explores the unique properties of universal adversarial perturbations, revealing their semantic locality, spatial invariance, and reduced reliance on non-robust features compared to standard adversarial attacks.

Contribution

It identifies key properties of UAPs that differentiate them from standard adversarial perturbations, including human-aligned semantic and spatial features.

Findings

01

Targeted UAPs exhibit semantic locality and spatial invariance.

02

UAPs contain less signal for generalization than standard adversarial perturbations.

03

UAPs leverage non-robust features to a smaller extent.

Abstract

We identify properties of universal adversarial perturbations (UAPs) that distinguish them from standard adversarial perturbations. Specifically, we show that targeted UAPs generated by projected gradient descent exhibit two human-aligned properties: semantic locality and spatial invariance, which standard targeted adversarial perturbations lack. We also demonstrate that UAPs contain significantly less signal for generalization than standard adversarial perturbations -- that is, UAPs leverage non-robust features to a smaller extent than standard adversarial perturbations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research