Provable Unrestricted Adversarial Training without Compromise with Generalizability
Lilin Zhang, Ning Yang, Yanchao Sun, Philip S. Yu

TL;DR
This paper introduces PUAT, a novel adversarial training method that defends against unrestricted adversarial examples while maintaining high accuracy on natural data, addressing key limitations of existing approaches.
Contribution
PUAT is a new adversarial training framework that effectively handles unrestricted adversarial examples and improves standard generalizability by aligning data and adversarial distributions.
Findings
PUAT achieves comprehensive robustness against UAE and RAE.
PUAT improves natural data accuracy without sacrificing adversarial robustness.
Theoretical analysis confirms the effectiveness of PUAT.
Abstract
Adversarial training (AT) is widely considered as the most promising strategy to defend against adversarial attacks and has drawn increasing interest from researchers. However, the existing AT methods still suffer from two challenges. First, they are unable to handle unrestricted adversarial examples (UAEs), which are built from scratch, as opposed to restricted adversarial examples (RAEs), which are created by adding perturbations bound by an norm to observed examples. Second, the existing AT methods often achieve adversarial robustness at the expense of standard generalizability (i.e., the accuracy on natural examples) because they make a tradeoff between them. To overcome these challenges, we propose a unique viewpoint that understands UAEs as imperceptibly perturbed unobserved examples. Also, we find that the tradeoff results from the separation of the distributions of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsRegularized Autoencoders
