Provable Unrestricted Adversarial Training without Compromise with   Generalizability

Lilin Zhang; Ning Yang; Yanchao Sun; Philip S. Yu

arXiv:2301.09069·cs.LG·May 21, 2024·1 cites

Provable Unrestricted Adversarial Training without Compromise with Generalizability

Lilin Zhang, Ning Yang, Yanchao Sun, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces PUAT, a novel adversarial training method that defends against unrestricted adversarial examples while maintaining high accuracy on natural data, addressing key limitations of existing approaches.

Contribution

PUAT is a new adversarial training framework that effectively handles unrestricted adversarial examples and improves standard generalizability by aligning data and adversarial distributions.

Findings

01

PUAT achieves comprehensive robustness against UAE and RAE.

02

PUAT improves natural data accuracy without sacrificing adversarial robustness.

03

Theoretical analysis confirms the effectiveness of PUAT.

Abstract

Adversarial training (AT) is widely considered as the most promising strategy to defend against adversarial attacks and has drawn increasing interest from researchers. However, the existing AT methods still suffer from two challenges. First, they are unable to handle unrestricted adversarial examples (UAEs), which are built from scratch, as opposed to restricted adversarial examples (RAEs), which are created by adding perturbations bound by an $l_{p}$ norm to observed examples. Second, the existing AT methods often achieve adversarial robustness at the expense of standard generalizability (i.e., the accuracy on natural examples) because they make a tradeoff between them. To overcome these challenges, we propose a unique viewpoint that understands UAEs as imperceptibly perturbed unobserved examples. Also, we find that the tradeoff results from the separation of the distributions of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsRegularized Autoencoders