Adversarial Robustness: What fools you makes you stronger
Grzegorz G{\l}uch, R\"udiger Urbanke

TL;DR
This paper demonstrates an exponential sample complexity separation between standard PAC learning and a specialized equivalence-query model, and applies this to develop an adaptive, provably robust adversarial defense scheme.
Contribution
It introduces a novel exponential separation result and proposes a new adaptive adversarial defense approach that is provably robust against a strong adversary.
Findings
Existence of an efficient adversarial-learning scheme that resists strong attacks or achieves low error
The scheme uses exponentially fewer samples than traditional PAC bounds
Theoretical proof of sample complexity advantages in adversarial robustness
Abstract
We prove an exponential separation for the sample complexity between the standard PAC-learning model and a version of the Equivalence-Query-learning model. We then show that this separation has interesting implications for adversarial robustness. We explore a vision of designing an adaptive defense that in the presence of an attacker computes a model that is provably robust. In particular, we show how to realize this vision in a simplified setting. In order to do so, we introduce a notion of a strong adversary: he is not limited by the type of perturbations he can apply but when presented with a classifier can repetitively generate different adversarial examples. We explain why this notion is interesting to study and use it to prove the following. There exists an efficient adversarial-learning-like scheme such that for every strong adversary it outputs a classifier that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
